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Preface 


This book is intended both for number theorists and more generally for 
working algebraists, though some sections (notably §15) are likely to be 
of interest only to the former. It is largely an account of mainstream 
theory; but for example Chapter 3 and §20 should be seen as illustrative 
applications. 

An algebraic number field is by definition a finite extension of Q, and 
algebraic number theory was initially defined as the study of the properties 
of algebraic number fields. Like any empire, its borders have subsequently 
grown. The higher reaches of algebraic number theory are now one of the 
crown jewels of mathematics. But algebraic number theory is not merely 
interesting in itself. It has become an important tool over a wide range 
of pure mathematics; and many of the ideas involved generalize, for ex- 
ample to algebraic geometry. Some applications to Diophantine equations 
can be found among the exercises, but there has not been room for other 
applications. 

Algebraic number theory was originally developed to attack Fermat’s 
Last Theorem — the assertion that X* + Y" = Z” has no non-trivial 
integer solutions for n > 2. It provided proofs that many values of n are 
impossible; some of the simpler arguments are in §13. But it did not provide 
a proof for all n, though recently the theorem has been proved by Andrew 
Wiles, assisted by Richard Taylor, by much more sophisticated methods 
(which still use a great deal of algebraic number theory). There are still 
respectable mathematicians seeking a more elementary proof, and this is 
not a ridiculous quest; but even if a more elementary proof is found, it is 
almost bound to be highly sophisticated. 

There are two obvious ways of approaching algebraic number theory, 
one by means of ideals and the other by means of valuations. Each has its 
advantages, and it is desirable to be familiar with both. They are covered 
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in Chapters 1 and 2 respectively. In this book I have chosen to put the 
main emphasis on ideals, but properties which really relate to local fields 
(whether or not the latter are made explicit) are usually best handled 
by means of valuations. Chapter 3 then applies the general theory to 
particular kinds of number field. The first two chapters (perhaps omitting 
§9), together with the easier parts of Chapter 3 and the first half of the 
Appendix, would form a satisfactory and self-contained one-term graduate 
course. 

Though §9 is more advanced than the rest of Chapter 2, its logical home 
is there; it is needed in Chapters 4 and 5, and introduces language which is 
widely used across number theory. The somewhat peripheral §12 depends 
on the results stated in §14 and proved in §15 of Chapter 4, as do parts 
of §13.1, and thus they are not in the correct logical order; but there are 
advantages in collecting all the information on special kinds of number field 
in a single chapter. 

There are important results which, though not in appearance analytic, 
can as far as we know only be proved by analytic methods. Indeed it has 
been said: ‘The zeta function knows everything about the number field; we 
just have to prevail on it to tell us.’ Some of what it has already told us 
can be found in Chapter 4. 

The more advanced parts of the algebraic theory are generally known as 
class field theory; most of the proofs involve Galois cohomology, either 
openly or in disguise. Anyone who writes a book on algebraic number 
theory is faced with a dilemma when he comes to class field theory. Most 
authors stop short of it; but working algebraists ought to know the main 
results of class field theory, though few of them need to understand the 
rather convoluted proofs. I would think it wrong to make no mention 
of class field theory; but to have included the proofs and the necessary 
background material would have doubled the length of the book without 
doubling its value. In consequence §§17 and 18 present an exposition of 
class field theory without proofs. In §19 we deduce the general reciprocity 
theorems, which are the simplest major applications of class field theory. In 
addition, §20 contains a proof of the Kronecker-Weber Theorem that every 
abelian extension of the rationals is cyclotomic; it is this result which made 
the general structure of classical class field theory plausible long before it 
was proved. The proof of the Kronecker-Weber Theorem is also rather 
convoluted, but it illustrates most of the ideas in the first two chapters. 

The reader needs to know the standard results about field extensions 
of finite degree, including the relevant Galois theory. The properties of 
finitely generated abelian groups and lattices, and of norms and traces, are 
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described in §A1.1 and §A1.2; most readers will already know these results, 
but those who do not will need to start by reading these two subsections. 
The existence of Haar measure and the Haar integral (described without 
proofs in §A1.3) is a fact which all working mathematicians should know, 
though again they have no need to study the proofs. Indeed, the main use 
of the general theory is to provide motivation and guidance; in any particu- 
lar case one can expect to be able to define explicitly an integral having the 
required properties, and thereby evade any appeal to the general theory. 
The status of §A2 is rather different. The Galois theory of infinite exten- 
sions is not actually needed anywhere in this book; but anyone who uses 
the results in Chapter 5 may need to consider field extensions of infinite de- 
gree. The remaining subsections of §A2 cover (without proofs) characters, 
duality and Fourier transforms on locally compact abelian groups; these 
are prerequisites for §15, but also for much of advanced number theory. 

The book concludes with a substantial collection of exercises. Others 
can be found in the text; see the index. The latter are results which are 
too peripheral to justify the provision of a detailed proof but which may be 
interesting or useful to the reader. Each of them is provided with ‘stepping- 
stones’: intermediate results which are individually not too difficult and 
which should enable the reader to construct a complete proof. 
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Numbers and Ideals 


1 The ring of integers 


Unless otherwise stated, throughout this book K and k will be algebraic 
number fields, even though some results hold more generally. The ring of 
integers of k (yet to be defined) will be denoted by o or o;; the ring of 
integers of K will be O or Ox. What are the properties which one would 
like the integers of k to have? Some obvious ones are the following:- 


1. 0, is a commutative ring. 

2. 0, Q = Z, so that the integers which are rational are just the rational 
integers. 

3. 0% @z Q =k, so that each a in k can be written as c3 where c is in Q 
and @ is an integer in k. 

4. If ais in Q, the algebraic closure of Q, the property that a is an integer 
only depends on @ and not on the field in which we are working; 
in other words, 


0, = kM {integers of Q}. 
5. If a and a’ are conjugate over Q and a is an integer, then so is a’. 
gs 


There is a largest subring of k satisfying these conditions, but no smallest 
one; so we shall choose 0, to be the largest such subring. It follows from 1, 
2 and 5 that if a is an integer then its monic irreducible polynomial over Q 
has coefficients in Z. We shall say that a in Q is an algebraic integer if 
it satisfies one of the three equivalent conditions in the following theorem. 


Theorem 1 Let a be an element of Q; then the following conditions on a 
are equivalent: 


(i) Zla] is a finitely generated Z-module; 
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(ii) @ is a root of a monic polynomial with coefficients in Z; 
(iii) the monic minimal polynomial for a over Q has coefficients in Z. 


Proof It is trivial that (iii)=(ii). To prove (ii)=(i), let m be the degree of 
the monic polynomial given by (ii); then a” is in the Z-module generated 
by l,a,...,a@~!, whence a is in the same module for all 4 > m. To 
prove (i)=>(ii) choose elements f,(a),..-, fn(a@) which span the Z-module 
Za], where the f,(X) are in Z[X]. For any N > 0 there is a relation 


aN _ AA) =0 


where the b, are in Z, and if N is greater than the degree of any of the f, 
this is the monic equation required. 

It remains to prove (ii)=>(iii). Let f(X) with coefficients in Z be the 
monic polynomial for a given by (ii), and let g(X) be the monic minimal 
polynomial for a over Q. There is a monic polynomial h(X) with coef- 
ficients in Q such that f(X) = 9(X)h(X). IE (iii) were false then there 
would be a prime p which divided the denominator of at least one of the 
coefficients of g(X). Let p* with u > 0 be the greatest power of p which 
divides any of the denominators of the coefficients of g(X) and let p” with 
v > 0 be the greatest power of p which divides any of the denominators of 
the coefficients of h(X). Let F, denote the finite field of p elements. In 


p’*" f(X) = {p"9(X) }{p"h(X)} 


all the coefficients have denominators prime to p; reducing mod p we obtain 
an expression for 0 as the product of two non-zero polynomials in F,[X]. 
This is impossible; hence (iii) must be true. | 


With this definition requirements 2 to 5 are trivial. To prove 1, let a, 3 be 
algebraic integers, so that there are corresponding polynomials satisfying 
(ii) with degrees m,n respectively. Thus 1,a,... ,a@”—! span the Z-module 
Z{a] and similarly for 3; hence the a4B” withO <up<m,0<v <n span 
the Z-module Z{a, 3]. Since it is finitely generated, so are its submodules 
Zla + A] and Z[af]; so a+ GB and af are integers. 


Lemma 1 [If k is identified with Q” where [k : Q] =n, then ox is a lattice 
ink. 


Proof Let o,,...,Q, be a base for k as a Q-vector space. We can find 
m, #0 in Z such that the 6, = m,a, are in 0;; so 0, spans k. For any @, 
in Q write € = >> f,a,. If 0, is not discrete in k then there are arbitrarily 
small ¢,,...,£, such that € is non-zero and in o;; hence norm,;aé is in 
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Z, and it is non-zero because it is the product of conjugates of &. But 
norm = ¢(£,,... ,£,) where ¢ is a homogeneous polynomial of degree n 
with coefficients in Q; so we can choose the @, so small that |¢|]<1. 


More generally, an order in k is defined to be any subring of 0, which 
contains 1 and has finite index in 0, considered as a Z-module. Equiva- 
lently, an order is a subring of 0, which satisfies the analogues of 1, 2 and 
3. It is easy to write down some orders in k; for example, if n = [k : Q] 
and @ is any integer in k such that k = Q(a), then the Z-module spanned 
by l,a,... ,@"~! is an order in k. 

Let R, C Re be commutative rings with a 1. We shall say that R, is 
integrally closed in R, if a in Rg, all 6, in Ry and 


a” + Ba" +--.4 6, =0 


together imply that a is in Ry. 


Lemma 2 The ring of integers in Q is integrally closed in Q. 


Proof We argue as in the proof of 1 on page 1. In the notation above, the 
Z-module Z{(1,... , Gn] is finitely generated because it is contained in the 
ring of integers of Q(Gi,...,8n); suppose the finite set S spans it. In an 
obvious notation S,Sa,... ,Sa”~} span Z[a, B1,... , Bn]; hence the latter 
is finitely generated and so is its submodule Z/a]. oO 


The identification of k with Q” which we used in Lemma 1 does not 
enable us to compute a meaningful measure for k/o,, because there is no 
natural measure on the Q” in Lemma 1 which is not derived from ox. But 
there is an identification of k @q R which does achieve this. Let o, run 
through the n embeddings k — C. Of these, there are r, embeddings into 
R and r2 pairs of complex conjugate embeddings whose images are not in 
R. Here rj + 2rg = n; and r1,r2 will almost always have these meanings. 
Denote temporarily by o : k + C” the map given by a+ (o1a,... , Ona) 
and by V the R-vector space ck ®q R; since o,k C R if a, is real, and 
(o, x @)k is contained in a space R? if o,,@> are complex conjugate, 
dim V < n. But 0, — oo, is an isomorphism because it has trivial kernel, 
so oo, is a free Z-module on n generators; and cox is discrete in C” and 
hence in V, for the same reason as in the proof of Lemma 1. So co, is a 
lattice in V, and dimV =n. 

The inclusion V Cc C” induces a canonical measure of volume in V. The 
measure of V/co is finite and non-zero; it tells one how sparse the integers 
of k are. With a suitable choice of the canonical measure on V, the volume 
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of V/co is | det M|, where ay,... ,@» are a base for 0, and M is the matrix 
of the o,a,. Unfortunately det M is only defined up to sign, and is not 
necessarily in either R or k; so instead we consider 


dy = (det(M))? = det(‘MM) = det(Trkq (a,av)). 


This is called the absolute discriminant of k. Observe that M has ro 
pairs of complex conjugate rows, so its determinant is i7? times a real 
number; thus the sign of d, is (—1)". 

More generally, let 6,,... ,8n in 0, be linearly independent over Z, and 
let m be the index in ox of the Z-module generated by (1,... , Gn. If 


A?(61,--. Bn) = det(Tr(,,,)) (1) 


then a comparison of the last two displayed equations gives A? = m?d,. 
Conversely, if we start from linearly independent integers (1,..., 8, and 
try to find a base for o,, the fact that d, is an integer restricts us to finitely 
many possible m. 

We can further generalize (1) by considering K > k with [K : k] =n. 
Let a1,...,Qn be a base for K as a k-vector space, and write 


Nic/p (1+ Qn) = det(Trx/~(a,av)). 


Then A}, ,, is non-zero. For otherwise there would be a non-zero column 

vector {€1,..- ,&n) killed by the matrix (Trx,/;(@,@v)), where the €, are in 

k. Set y = >> &a,; then Tr(a,y) = 0 for each p and therefore Tr(ay) = 0 

for each a in K. But this cannot be true for a = y~!. This argument, and 

the two lemmas which follow, are valid for any field k of characteristic 0. 
The following result will be needed in §8. 


Lemma 3 Let K,k be algebraic number fields with K C k; then every 
k-linear map K — k is given by a+> Trx x(a) for some B in K. 


Proof Call this map ¢g. The k-linear map from K to the dual space of K 
given by 6 +> ¢g has trivial kernel. The two spaces involved have the same 
dimension as k-vector spaces, so the map is an isomorphism. O 


The following result, which will be needed in §13.1, is known as Hilbert’s 
Theorem 90; in highbrow language it states that a certain cohomology 
group is trivial. 


Lemma 4 Let K/k be a Galois extension whose Galois group Gal(K/k) 
is cyclic with generator o. If a in K is such that normx;,a = 1 then 
a= B/oB for some B in K; and we can take B to be integral. 


1 The ring of integers 5 
Proof Let [K : k] =n and for any y in K consider 
B=y-atoy-a:cat--++(o™!y)(a-:-0" 1a); 


then a-oG = 8. If 6 = 0 for every y then A(T ,Yn) = 0 for any 
Y1,+++,%n in K, and this we know to be false. We can make ( integral by 
multiplying it by a suitable element of o,. O 


One curious property of the absolute discriminant is the following. 


Theorem 2 (Stickelberger) We have d, = 0 or 1mod 4. 


Proof Write n = [k : Q], let a1,... ,Qn be a base for ox, and write 


A=) (Tes) B=)" (Tes 
all x ” m odd yb 

where 7 denotes a permutation of 1,... ,n. We have det M = A-2B and 

therefore d, = A? + 4(B? — AB). Both A and B are algebraic integers, 

and A is rational by Galois theory; hence B? — AB is also rational and is 

therefore a rational integer. Hence d, = A? = 0 or 1mod 4. oO 


The relation between k and 0, which we have been discussing is a special 
case of a much more general one. The results which we need are not much 
harder to prove in the general case, provided one takes the assertions in 
the right (somewhat unnatural) order. Until the end of this section, we 
therefore consider any pair of commutative rings R > o having a common 
identity element 1. (In the applications o will be an integral domain and 
R a field; but we do not need to assume as much as this.) For any a in R 
consider the following three statements: 


(i) ofa] is finitely generated as an o-submodule; 

(i)’ ofa] is contained in a ring R, C R which is finitely generated as an 
o-submodule; 

(ii) a@ is a root of a monic polynomial with coefficients in o. 


I claim that these statements are equivalent. Clearly (i) implies (i)’, and 
(ii) implies (i) as in the proof of Theorem 1. So assume that (i)’ holds 
and that (1,...,8n span Ry as an o-module. We have a8, = )> Yup for 
some 7, in 0; so a is a root of det(zJ — IT) = 0 where I is the matrix of 
the yy,,. This is just (ii). 

Denote by 0 the set of all a in R satisfying any of these conditions. An 
argument like that which follows the proof of Theorem 1 shows that 0 is 
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a ring with o C O C R. Similarly an argument like that in the proof of 
Lemma 2 shows that 0 is integrally closed in 2. If o is an integral domain, 
let k be its quotient field and assume R 3 k. In this case a is in D if and 
only if 


(iii) the monic minimal polynomial for a over k has coefficients in OM k. 


Since 0 is integrally closed in R, (iii) implies that @ is in D. Conversely, if 
(ii) holds then a satisfies some monic equation f(X) = 0 with coefficients 
in 0; hence so does ca where g is any embedding k(a) — k. Thus each ca 
is in the integral closure of o in k. Whatever the characteristic, the monic 
minimal polynomial for a@ over k has the form 


9(X) = [](X - oa)” 
ios 
for some N > 0; and the coefficients of g(X) lie both in & and in the 
integral closure of o in k, so they lie in OMk. Note that we need not have 
0 = ONk; if for example o = Z{[/—3] and R = Q then a = 3(1+ Y—3) is 
in O because it satisfies a? -a+1=0, and in fact ON k = Z[al. 

The unnatural looking criterion (i)’ in the preceding discussion has been 
included because an o-submodule of a finitely generated o-module need not 
be finitely generated. (See Exercise 1.6.) This is frequently inconvenient; 
so if,o is a commutative ring with a 1 we say that an o-module M is 
Noetherian if every o-submodule of M (including M itself) is finitely 
generated, and we say that o itself is Noetherian if 0 is a Noetherian 
o-module — in other words, if every ideal of o is finitely generated. Any 
o-submodule of a Noetherian module o is Noetherian; but a subring of a 
Noetherian ring need not be Noetherian. 

We say that an o-module M satisfies the ascending chain condition if 
every increasing sequence M, C M2 C --- of o-submodules of M stabilizes 
— that is, if the sequence is eventually constant. 


Lemma 5 The three following conditions are equivalent: 


(i) M is Noetherian; 

(ii) M satisfies the ascending chain condition; 

(iii) every non-empty family of o-submodules of M contains mazimal ele- 
ments. 


Proof (i)=>(ii). Suppose that M, C M2 C --- is an increasing sequence of 
o-submodules of M; then M! = UM, is an o-submodule of M. Hence it 
is generated by a finite set S of elements of M*. If My contains all the 
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elements of S, then M# C My; so the increasing sequence is constant from 
M N ON. 

(ii)=>(iii). If (iii) were false we could construct a strictly increasing se- 
quence M, C M, Cc --- of o-submodules of M. For suppose we have chosen 
M;,...,M,. The o-submodules of M which contain M, form a non-empty 
family, and this family contains no maximal elements; so we can choose an 
My41 which strictly contains M,. This contradicts (ii). 

(iii) (i). Suppose that M contains an o-submodule N which is not 
finitely generated. Let S be the set of all finitely generated o-submodules 
of N, let Mp be an element of S and let € be an element of N not in 
Mo. Then Mp is not maximal in S because S contains the strictly larger 
o-submodule of N generated by Mp and &. O 


Lemma 6 Let L, M,N be o-modules such that the sequence 
0-L—-~M—=N-—0 
is exact. Then M is Noetherian if and only if L and N are. 


Proof If M is Noetherian then L is a submodule of M and therefore also 
Noetherian; and if N; is a submodule of N then its inverse image in M is 
finitely generated and hence so is Nj. 

Conversely, if L and N are Noetherian let 96: L — M andy: M—o4WN 
be the maps in the exact sequence. Let M, C M2 C --- be an increasing 
sequence of o-submodules of M. The increasing sequences {¢(L) 9 M_} 
and {i(M,)} consist of o-submodules of ¢(Z) + L and N respectively, so 
they both stabilize — say by the n-th term. If now v > n then 


o(L) NM, = o(L)N M,, = L* 


say, and M,/L* =~ ~(M_) = ¥(M,) = M,/L*; so M, = M,, and the 
sequence {M,} also stabilizes by the n-th term. O 


Theorem 3 Ifo is Noetherian then an o-module is Noetherian if and only 
if it is finitely generated. 


Proof It follows from Lemma 6 by induction that the direct sum of finitely 
many copies of o is Noetherian. But any finitely generated o-module is a 
homomorphic image of such a direct, sum; so it too is Noetherian. O 


To study Noetherian modules over a non-Noetherian ring would be eccen- 
tric, so Theorem 3 reduces us to the study of Noetherian rings. Let 0 — o’ 
be a surjective homomorphism of rings and assume that o is Noetherian. 
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Then o’ is a Noetherian o-module, by Theorem 3, so all its ideals are finitely 
generated as o-modules. Hence they are finitely generated as o’-modules; so 
o’ is Noetherian. In other words, the homomorphic image of a Noetherian 
ring is Noetherian. 

Let 0 be a ring containing 0; we shall say that 0 is finitely generated 
over o as a ring if D = o[a1,... ,@»] for some elements a1,... ,Qp in O. 


Theorem 4 (Hilbert Basis Theorem) If 0 is Noetherian and 0 is 
finitely generated over 0 as a ring, then D is Noetherian. 


Proof We prove first that if X is transcendental over o (that is, if X is not 
a root of any polynomial with coefficients in 0) then o[X] is Noetherian. 
For let J be an ideal in o[X]. The set M of leading coefficients of elements 
of J is an o-ideal. Let B be a finite set which spans M as an o-module 
and for each element of B choose a polynomial in J with that element as 
its leading coefficient. Let S be the set of these polynomials, and let N 
be the largest of their degrees. For each n with 0 < n < N let M, be 
the o-ideal of the coefficients of X” in polynomials in J of degree at. most 
n, and let B, be a finite set which spans M,,. For each element of By, 
choose a polynomial in J of degree at most n with that element as the 
coefficient of X”, and let S, be the set of these polynomials. I claim that 
J is spanned as an o[X]-module by the set S! which is the union of S arid 
the S, with 0 <n < N. For if not, among the elements of J which do not 
lie in the o[X]-module spanned by S*, let f(X) be one of lowest degree. If 
deg f(X) 2 N, let the leading coefficient of f(X) be a = }> Ba, where 
the 6, are in o and the a, are elements of B. Let f,(X) be the element of 
S whose leading coefficient is a,,. Then 


P= YB XO Meh F(X) 


is an element of J which has degree less than that of f(X), so it is in the 
o[X]-module spanned by S#. Hence so is f, which is a contradiction. If 
deg f(X) = n < N then a similar argument works, using the elements of 
S, instead of those of S. 


If X,,...,Xm are independent transcendentals over o, it follows by in- 
duction that o[X1,...,Xm] is Noetherian. Finally, if O = o[a1,... ,am] 
is any finitely generated ring over o then it is the image of 0[X),... , Xm] 
under a suitable homomorphism; so it too is Noetherian. Oo 


If we know some Noetherian rings, Theorem 4 enables us to generate 
many more. But any principal ideal domain is Noetherian; in particular Z 
and all fields are Noetherian. For our purposes what matters is that 0, is 
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Noetherian whenever k is an algebraic number field, because 0, is finitely 
generated over Z by Lemma 1. 


2 Ideals and factorization 


Let 0; be the ring of integers of k. Unfortunately we do not in general have 
unique factorization in o,; for the standard example see Exercise 1.2. Of 
the various ideas that have been introduced to alleviate this situation, two 
have turned out to be valuable: these are ideals (described in this section) 
and valuations (described in Chapter 2). 

Of the key theorems about ideals in 0,, Theorems 5, 6 and 7 below are 
usually proved in a more general setting — that of Dedekind domains. The 
disadvantage of this approach is that it involves one very opaque proof — 
in our case, that of Theorem 5. In the exercise at the end of this section 
the reader will find a simpler approach, but one whose validity is confined 
to algebraic number fields. 

A Dedekind domain is an integral domain o with a 1 such that 


(i) o is Noetherian and integrally closed in its field of fractions, 
(ii) every non-zero prime ideal of o is maximal. 


Denote the quotient field of o by k. The following lemma shows that we 
cannot get rid of non-principal ideals by replacing o by a smaller ring having 
the same quotient field k. We can do so by replacing o by a slightly larger 
ring R (see Exercise 1.10) and this is sometimes useful; but there is a price 
to be paid. In particular, we cannot expect FR to satisfy the criteria at the 
beginning of §1. 


Lemma 7 Any principal ideal domain is Dedekind. 


Proof Let o be a principal ideal domain, and therefore Noetherian. Suppose 
that @ in k is integral over o and write G = a,/a2 with a1,a2 ino. We 
can assume that (a@,,a2) = (1); for if (a1,a2) = (y) with y not a unit, 
we can divide a; and ag by y. If 6? +6"! 4+---+c, = 0 where 
the c, are in o then af + crap ‘ag +--:+ craze = 0. It follows that 
(a2) = (at,a2) D (a1, a2)” = (1), so that ag is a unit and @ is in o. 
Now let (a) be a non-zero prime ideal of o and let (3) be a maximal ideal 
containing (a). Thus a is in (@) and hence equal to Gy for some y in o. 
But (q) is prime, so one of G, y must be in (a). If G is in (a) then (8) C (a) 
so that (a) is maximal; but if y = ad then a = Bad whence $5 = 1, and 
then (8) = 1 which is forbidden. O 
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By a fractional ideal a we mean a finitely generated o-submodule of k. 
(To avoid ambiguity, an ideal in 0 sometimes has to be called an integral 
ideal.) If a1,...,@m Span a as an o-module and a, = 6,/7, with y,,5, 
in o, then ya C o where y = [| 7. Hence any fractional ideal has the form 
cb, the set of c@ where c is a fixed element of k and @ runs through the 
elements of some ideal 6. Conversely, any such set cb is a fractional ideal. 
All the obvious rules extend from ideals to fractional ideals. 


Theorem 5 The non-zero fractional ideals of a Dedekind domain form a 
multiplicative group. 


Proof The only difficulty is to prove the existence of inverses. The proof 
proceeds through a sequence of assertions. 


e Ifa is a non-zero ideal then pip2---pm C a for some prime ideals p,,. 


Suppose the assertion is false; then because o is Noetherian we can choose 
a maximal among the ideals which do not have this property. Since a is 
not itself prime, we can choose (3, G2 in o but not in a such that (; (> is in 
a, Write b, = (a, 6,); then each 6, strictly contains a, so by maximality 
it contains a product of prime ideals, and hence so does a D bbe. 


e Every non-zero prime ideal p is invertible. 


Let p~ be the o-module of elements a in k such that ap C 0; we shall show 
that p~ is the inverse of p which we are looking for. Choose 8 # 0 in p; 
then Bp~ C 0 is an o-module and hence an ideal; so p~ is a fractional ideal. 
Since p~ > 0, we have o D p7p 2 p; and because p is maximal, we must 
have equality in one inclusion or the other. If we have equality in the first 
inclusion, p~ is the inverse of p which we are looking for; so we assume 
that p = p~p and derive a contradiction. Choose m minimal so that there 
exists a product 


PiP2--'Pm C (8) Cp, 


where @ is as before. One of the p,, say pi, must be contained in p and 
thus equal to p; for otherwise for each y we could choose a, in p, but not 
in p, and [[ a, would not be in p. By the minimality of m, po--- pm ¢ (B) 
and hence there exists y in p2---pm but not in (8). Now yp C (8), so that 
6 = 7/8 is in p~ but not in o. Thus dp C pp = p; and if {(1,... , Ba} 
spans p as an o-module then 68,, = 57 Yy.(y for some ¥,, in o. This implies 
det(6J — I‘) = 0 where I is the matrix of the y,,,; and since o is integrally 
closed in k it follows that 6 is an integer, which is a contradiction. 
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e Every non-zero integral ideal is invertible. 


If not, there would be a maximal non-invertible ideal a. Among the ideals 
containing a there is one which is maximal and therefore prime; denote it 
by p. Thus a C p~!a C o. If a = p7!a then an argument like that for 
the previous displayed statement shows that p~! C 0, which would imply 
o=pp-! cp. Soa #4 p~ta. By maximality p~'a has an inverse b, and 
bp~! is an inverse for a. 

Finally, any fractional ideal ca has an inverse c~!a~?}. Oo 


Corollary For any non-zero ideals a,,a2 the four assertions a, C aa, 
ajaz} C0, aya D0 and aga; are equivalent. 


Theorem 6 Let 0 be a Dedekind domain. Any non-zero integral ideal a in 
o can be written as a product 


a=fi-''Pm : (2) 


where the p, are prime ideals, and this expression is unique up to the order 
of the factors. 


Proof If there are non-zero a with no decomposition (2), we can assume 
that a is maximal among them. There exists a maximal ideal p D a, so 
p~a is an integral ideal strictly containing a; for p~'a = a would imply 
p~! = 0 on multiplying by a~!. Hence p~1a is a product of prime ideals, 
and if we multiply this product by p we get a product for a. 

Suppose that there are two essentially different factorizations of some a. 
Among such ideals a and their factorizations let 


a= Pie's Pm = 41 -° dn 


be the formula with the least value of m. We cannot have m = 0, because 
then a = (1) and so also n = 0. If p; were not among the q,, we could 
for each v find a, in q, but not in p, and [][a, would be in a but not 
in p,, which gives a contradiction. So we can remove a factor p; in both 
expressions for a, and m is not minimal. O 


Corollary Leto be a Dedekind domain with quotient fieldk. Any non-zero 
fractional ideal a in k can be written in the form 


& = Pi--*Pm/qi--* Gn (3) 


where the py,qu are prime ideals and no p,, is equal to any q,; and this 
expression is unique up to the order of the factors. 
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Proof We have a = (71/72)6 for some integral 6 and some non-zero 71,72 
in o; applying the theorem to (¥1), (72) and 6 gives an expression (3), and 
we can remove any factors common to the numerator and denominator. 
Suppose that there were a second such expression 


A= Pp Pi /41°° Ia 
then we would have 


and since no p, is a q, and no pi, is a q,, it follows from the uniqueness 
clause in the theorem that the pi,, are a permutation of the p,, and the q,, 
a permutation of the q,. O 


The theorem which follows, and also Corollary 1 to it, are each known 
as the Chinese Remainder Theorem. 


Theorem 7 Let a = [|] p.” be a non-zero ideal of 0; then the natural map 
@: 0 J] ](0/py") is onto and induces an isomorphism 


ofa [[(o/p2). 


Proof The kernel 6 of ¢ consists of the a which lie in each pj"; so b is 
the largest ideal divisible by each py”, by the Corollary to Theorem 5. 
Thus 6 = a. Since b; = p}?---p"™ is not divisible by p; and hence is not 
contained in it, we can find G; in by but not in py. Since p; is maximal, 
o/p; is a field, and hence we can find 7; in o whose image in o/p is the 
inverse of the image of 6,. Thus 6;y1 = 1 — 6; with 6; in py. Now 1-6}! 
is a multiple of 3; which is congruent to 1mod pj’. Hence the image of ¢ 
contains 1 x 0 x --: x 0. It also contains all the other similar expressions; 


so ¢ is onto. oO 

Corollary 1 Let a1,...,am be non-zero integral ideals coprime in pairs 

and let a1,...,Q@m be elements of 0. Then there exists a ino such that 
a=a,moda, (u=1,...,m). (4) 


Proof Write a, = [] put”; by hypothesis no prime ideal occurs more than 
once as a p,,, and each congruence (4) is equivalent to the finite set of 
congruences 


— nr 
a = a, mod pig’. 
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Now write a = [Ja,; the corollary just restates that the map ¢ in the 
theorem is onto. O 


Corollary 2 Let a,b be non-zero ideals; then we can find a ina such that 
(a)/a is prime to b. 

Proof Let p,,.-.,Pm be the prime factors of b. For each p = 1,...,m 
define n,, by py." ||a, and choose a, in pr” but not in prt”. By Corollary 
1 there exists a in o such that 


a =a,mod per foreach p» and a=0Omod (a/ |] pt); 


and this does what we need. Oo 


Corollary 3 Let a be a non-zero ideal and a’ a non-zero‘element of a; then 
there exists a ina such that a = (a,a’). 


Proof Apply Corollary 2 with b = (a’). oO 


Each of the last two corollaries extends to fractional ideals a. For we 
can find a non-zero c in o such that ca is integral. If y in ca is such that 
(y)/ca = (c7!y)/a is prime to b then a = c~!y meets the requirements of 
Corollary 2; and if 6 in ca is such that ca = (4,ca’) then a = c715 meets 
the requirements of Corollary 3. 

It follows from Theorem 7 that there is an isomorphism (though not a 
canonical one) 


p”"1 /p” = o/p (5) 


for any non-zero prime ideal p and any integer n > 0. For fix 9 in p"™~} 
but not in p”; the map 0 — p”~!/p” given by a ++ a@@ has kernel p, so 
we need only prove that it is onto. Given any 7 in p"~!, by Corollary 1 to 
Theorem 7 we can find 7; satisfying 


v1 =ymodp", 1 = Omod ((B)/p”~*). 


Now a; = 71/8 is an integer, because any prime power which divides 3 
also divides 71; and a, maps to 7 and hence to y mod p”. 


Now let k be an algebraic number field and o the ring of integers in k. 
It follows from Lemma 1 that any non-zero fractional ideal a is a lattice in 
k. We now define the absolute norm Norma of a; note the capital letter. 
Let V be the R-vector space k @q R, on which volume is defined up to 
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multiplication by an arbitrary constant. Both o and a are lattices in V, so 
we can write 
Norm a = vol(V/a)/vol(V/o). 
If a is integral, then 
Norma = [o : a]. (6) 
By convention, and despite this definition, we write Norm(0) = 0 where 
(0) is the zero ideal. If a 4 0 is in k and ay,... ,a@, are a base for a as a 


Z-module, then aa1,... ,@Q, are a base for aa. Using the volume defined 
on page 3, we deduce 


ee 


o 


Norm(aa) = Norm a = |norm,/qa| Norma (7) 


and in particular Norm((a)) = |norm,;qa|. We could instead have used 
(6) and (7) to define the absolute norm; this is more algebraic but clumsier. 


Lemma 8 /f a1, a2 are fractional ideals in k, then 
(Norm a;)(Norm az) = Norm(azag). 


Proof In view of (7) it is enough to prove this when the a, are integral and 
non-zero. If a = [] py” is integral, then by (5), (6) and Theorem 7 


Norma = I[l > Py] = [[(Norm Pu)”. 


The lemma now follows by factorizing a; and ag. O 


For some applications it is convenient to generalize o,, the purpose being 
to enable us to ignore certain ‘bad’ primes. If p is a prime ideal of 0,, we 
say that a in k is integral at p if a = a,/a2 where a1, a2 are in o, and 
a2 is not in p; provided that a # 0, this is equivalent to saying that p does 
not occur among the q, in the factorization (3) of (a). More generally, 
let S be any set of primes, and define os to be the set of elements of k 
integral at each prime outside S. (This notation must not be confused 
with the much more important o, introduced in Chapter 2.) It is easy to 
see that os is Dedekind. The ideals of og are just the as = a®, os where 
a is an ideal of 0; for we can recover one possible a from any ideal 6 of 
og because a = 6NMo implies 6 = as. To any fractional ideal a in k there 
corresponds the fractional ideal as = a @, os; but recovering a possible a 
from a fractional ideal for os is untidy and is best done by means of (3). 
The prime ideals of os are just the ps where p is a prime ideal of o not in 
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S; to derive from (2) a factorization of as we delete from the right hand 
side any primes in S and put a subscript S on the others. 


There is a general theorem that if o is a Dedekind domain with quotient 
field k, and if K is a finite algebraic extension of k and 0 the integral closure 
of o in K, then 0 is Dedekind; taking o,k to be Z, Q it follows that the 
ring of integers of any algebraic number field is a Dedekind domain. But 
the proof of this general theorem is tedious and there is an easy shortcut. 


Theorem 8 [fo is the ring of integers of an algebraic number field k then 
o is Dedekind. 


Proof We have only to prove that any non-zero prime ideal p in o is maximal 
— in other words that o/p isa field. It is an integral domain, so we only have 
to prove the existence of inverses. Let a be in o but not in p. Multiplication 
by a gives a map of o/p to itself with trivial kernel; since o/p is finite such 
a map must be onto, so the class of a has an inverse. O 


Since o/p is a finite field, it contains p/ elements for some rational prime 
p and some f = f, > 0. Clearly p > (p), so let p® be the exact power 
of p which divides (p). The primes with ep > 1 are called ramified; we 
shall see later that for each k there are only finitely many of them. We can 
factorize (p) as a finite product (p) = [] p*; taking Norms on both sides 
and writing n = [k : Q] we obtain 


n= > evfe (8) 


where the sum is taken over all p dividing a fixed p. The analogue for 
extensions K/k will be proved as the Corollary to Lemma 13. 


Let o be a Dedekind domain with quotient field k, and let J, be the 
group of non-zero fractional ideals of k. The ideal class group of k is 


C, = J;,/(group of non-zero principal ideals). 


If k is an algebraic number field and o its ring of integers, we shall show 
below that the order of J; is finite; it is called the class number of k and 
is denoted by h. There is an exact sequence 


0-08 —Sk* ok OC, 0, 


where o* is the group of units of o. For a general Dedekind domain, we 
can say nothing useful about the two outer terms. But if o is the ring of 
integers of an algebraic number field k (or even if o = og for some set S of 
primes in such a k), they have important properties. We consider units in 
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83; the main theorem about C, is Theorem 9 below, though it too will be 


refined in §3. 
Let [k : Q] = n and fix a base a1,... ,@p for 0, as a Z-module. Any § 
in k can be written 


B= bio, +---+b,a, with b, in Q; 
we define the height of 6 to be H(8) = >~|b,|, which gives a metric on k. 
Write C = max H(a,a,). If @ is in 0, then H({) is in Z, as is C. 


Lemma 9 Let a be a non-zero fractional ideal in an algebraic number field 
k and a a non-zero element of a of minimal height. For any B ina we can 
find y in o, and m in Z such that 


H(mB/a—y)<(C+1)71 and 0<m<M, (9) 
where M = (n(C +1))” +1. 


Proof If we multiply a, a and £ by the same integer N, we alter neither 
the hypotheses nor the conclusions; so we can assume that a is integral. In 
particular, H is integral on a, so its minimum is attained and a exists. We 
now use the pigeonhole principle. For any 7,, in o write 


mB/a- Im = of ay +. +el™a, with ef™ in Q; 


given m we can choose ym in o so that 0 < cl™ <1 for each v and thus 
determine a point P,, = (\™, er ci) in the unit cube. Partition this 
cube into M — 1 subcubes of side (n(C + 1))~!, each defined by 


ry/(n(C +1)) < ey < (ry +1)/(n(C + 1) 


for some integers r, with 0 < r, < n(C +1). Two of Py,..., Px must 
lie in the same subcube — say Pp, and Pp, with my < mg. Writing 
m=mM2—-™, Y = Ym. — Ym, We obtain (9). Oo 


Theorem 9 If k is an algebraic number field the order of Cy is finite. 
Proof For any € = )leyay, &’ = >> ea, in k we have 
Hee’) =H (yo eeeauay) 
< LD level lf (ayar) < CH()H(€) 
in the notation above. It now follows from (9) that 


H(m8 — ary) < CH(a)H(mB/a~ 7) < CH(a)/(C + 1). 
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But mf — ay is in a, and must therefore vanish because a has minimal 
height among the non-zero elements of a. It follows that (M4!)@ is a multiple 
of @ for every 8 in a. Now consider a, = (M!)a/a. It is an integral ideal 
in the class of a, and it contains M! because a contains a; so it is a union 
of cosets of (M!)o in o. But there are only finitely many ideals composed 
in this way. Oo 


The corresponding result for (0,)s5 as defined on page 15 follows at once, 
If a and 6 are in the same ideal class for 0, so are as and bg for os; so 
Theorem 9 for os follows from Theorem 9 for o. 


We shall obtain a much more realistic way of bounding the order of C, 
in Theorem 10 below, whose proof does not depend on Theorem 9. The 
real point of the proof above lies in the following exercise. 

Exercise Without assuming any of the earlier results in this section, de- 
fine a relation between non-zero fractional ideals a,a’ in a given algebraic 
number field k by 


a~a’ if there exist non-zero a,a’ in k such that aa = aa’. 


Prove that this is an equivalence relation, and that the argument of Lemma 
9 and Theorem 9 shows that there are only finitely many equivalence 
classes. Now proceed as follows. 


e If ab C ab for some non-zero 6, then a C (a). 


Let f1,...,8n be a base for 6 and for any a’ in a write A = a’/a. Since 
Ab C 6 there exist c,, in Z such that AB, = > ey,G,; so A satisfies 
det(AJ — C) = 0 where C is the matrix of the c,,. Hence X is an integer. 


e If ab = ab for some non-zero 6, then a = (a). 


Write a’ = a~1a, which we now know to be integral. Thus 6 = a’b, so that 
Bv = apf, with a,, in a’. Hence det(J —.A) = 0 where A is the matrix 
of the a,v; expanding this determinant shows that 1 is in a’. 


e For any fractional ideal a, there exists 6 4 (0) such that ab is principal. 


Among the a” with m > 0 there must be two in the same equivalence class 
— say a™ and a™ with m, < m2. Thus aja™! = aea™ for some a1, Q2, 
whence (a) = a2a™2—™! and we can take b = a™27™1—1. 

Now deduce the main results of this section for the special case when 
o = Ox, the ring of integers of an algebraic number field k. 
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3 Embedding in the complex numbers 


Let k be an algebraic number field with [k : Q] = n; in this section we 
exploit systematically the embedding k + R™ x C™ ~ R” already intro- 
duced on page 3. For convenience we order the embeddings o, : k + C so 
that o1,...,0,, are real and that o,,0,4,, are complex conjugate when 
ry <U <7, +172. We shall need the following elementary result. 


Lemma 10 Let A be a lattice in V ~ R” and let S be a bounded closed 
convex subset of V symmetric about the origin; then S contains a point of 
A other than the origin provided vol(S) > 2"vol(V/A). 


Proof Assume first that vol(S) > 2"vol(V/A). If the map 4S + V/A is 
one-one into, then vol($S) < vol(V/A) which is false. So there are points 
Py, P2 in 48 with the same image in V/A. By symmetry —P» is in 48, so 
that by convexity (Pi — P2) is in 48. Thus P; — P2 is in both S and A. 

If vol(S) = 2"vol(V/A) the same argument shows that (1+ )S contains 
a point Q, ~ 0 of A for any e > 0. But for e < 1 the candidates for Q, lie 
in the bounded discrete set 25M A, so they belong to a finite set. Hence 
there is a point Q # 0 of A which belongs to (1+ €)S for arbitrarily small 
e; and since S is closed, Q must belong to S. QO 


Theorem 10 Let o be the ring of integers of an algebraic number field k 
of discriminant d. 


(i) There is a constant C =C,,,,, depending only on r1,r2 such that each 
ideal class of k contains an integral ideal of Norm at most C\d|}/?. 
(ii) The order of Cy is finite. 


Proof Define a map k + V = R® by sending a in k to 
(O10, 06 py A, ROG, 41, . 6. RO 479, FOp, 41,» » SOP, py) 


where the o, are ordered as at the beginning of this section. If a is any 
non-zero fractional ideal in k, it follows from Lemma 1 and the remarks 
on page 10 that the image of a in V is a lattice. Let S be any closed 
bounded convex subset of V symmetric with respect to the origin; let v be 
the volume of S and M the maximum of 


2 2 2 - 
|zi-°- Gry (2p, 41 + Try trot1) 7 (Crete + t,)| 


in S. The volume of V/a~! is c = 2-72|d|}/?/Norma. For A = 2(c/v)1/ 
the volume of XS is 2c; so by Lemma 10 there exists a 4 0 in a7! NAS, 


3 Embedding in the complex numbers 19 


whence |norma| < A"M. But aa is an integral ideal in the same class as 
a, and 


Norm(aa) = |norma|Norma < 2+"? My q|}/2. 


This proves (i), and since there are only finitely many integral ideals in k 
with Norm less than a preassigned bound, (ii) follows. Oo 


If one is going to use this theorem to compute / (and in general no other 
method is known), then it is desirable to make C,., ,, = 27'+7? M/v as small 
as possible. An efficient choice of S is 

ritra 
jmift---+ltyl+2 D> Va2+22,,, <1 


veritl 


4\" n} 
Crayne = (=) nn 


The details of this calculation are given in the exercise below. Applying (i) 
to the principal ideal class and noting that C,,,,, <1, we obtain 


which gives 


Corollary For every algebraic number field k, |d| > (n"/n!)(47)"? > 1. 


Thus the discriminant of an algebraic number field k with [k : Q] = n grows 
at least exponentially with n. It was shown by Safarevié that no stronger 
statement can be true: see [CF], Chapter IX. 


Exercise Prove that the S chosen above is convex, and that M = n7". 
[For convexity, use the triangle inequality and 


AVzitytt (L—A)V/23+¥8 


> V (Az + (1 — A)z2)? + (Ayr + (1 — A)y2)? 


for 0 < A <1. In polar coordinates this last inequality is 


Ary + (1 = A)ra 2 V/ dr? + 2X(1 — A)rire cos(4, = 62) + (1 a d)?rZ, 


which is trivial. That M < n~” follows from the inequality between the 
arithmetic and geometric means.] 

Transforming pairs z,,2,4,, from Cartesian to polar coordinates, show 
also that v = 271(27)D,, ,.,(n) where 


Dem(t) = f--- f Y1° + Ym Az, ++ dxe dy, ---dym 
Re,m(t) 
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and Rem(t) is given by tp >O0(1< p<), yp 20(1< p<m) and 
Zpt---+2¢4+22%Ay t-::+ ym) <t. 


Prove that 


t t/2 
Dem(t) = I De-im(t — x)dx = Dem-1(t — 2y)y dy 
0 0 


and deduce De m(t) = 4~™t°+?™ /(€ + 2m)! by induction. Oo 


For small values of n, more is known. For r; = 0,r2 = 1 we can take 
Co, } = V3 and this appears to be all that can be said. In all other cases, 
there is known or conjectured to be an isolation theorem. To explain this, 
it will be convenient to write temporarily 


C, = (least Norm of an integral ideal in the class c)/|d,|?/?, 

where c is an ideal class for the field k. For r; = 2,rg = 0 we have 
C-1 > V5, but Cr! > V8 except when c is the unique ideal class in 
Q(V/5); the least point of accumulation of the C7! is 3, and all the classes 
c with Cz} < 3 can be specified. (For proofs of some of these statements, 
see the exercise on page 62.) For r; = 3,r2 = 0 we have Cr 1 > 9 except 
when c is the unique ideal class in the field Q(2cos 2) with d = 49. For 
ry) =1T2 = 1 we have C7! > /23. But the methods by which these results 
are established have nothing to do with algebraic number theory. They 
belong to the Geometry of Numbers, which was once fashionable but which 
(except for the widely applied Lemma 10) ceased to be so some thirty years 
ago in England and considerably earlier elsewhere. 

All these statements are best possible, in the sense that the constants 
cannot be improved. It is conjectured that both the cases r) = rg = 1 
and r; = 0,r2 = 2 have properties like those which have been proved for 
rT, = 2,rq = 0. More spectacularly, it is conjectured that for each pair 
71,72 With r}) +1r2 > 2 the Cr 1 have no finite point of accumulation. 


A unit of o is by definition an element a # 0 of o such that a7! is also 
in 0; thus the units form a group which in standard notation is just o*. 
An alternative and more useful definition is that a unit is any element a 
of o such that norm,;q(a@) = +1. For if a is a unit then norm(a) and 
norm(a~!) are elements of Z whose product is 1; conversely, if a is in o 
with norm(a) = +1 then ta™? is equal to norm(a@)/a, which is a product 
of powers of conjugates of a and therefore an integer. 
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Theorem 11 The group of units in o is the product of the group of roots 
of unity in 0, which is cyclic and finite, and a free group on r1 +r2 —1 
generators. 


Proof The map o* — R™+7? defined by 
ar (log |ojal,... , log jo,,4,,a1) (10) 
is a homomorphism from o* to the hyperplane 
Nitec tY¥,, +2441 +--°+ Yate) =0 (11) 


in R™+72_ Its kernel consists of the a in o* with |ca| = 1 for all o; so in 
the standard topology on k it is a bounded subset of the discrete set 0 and 
is therefore finite. If its order is N then every element of it is an N-th root 
of unity; and the kernel is cyclic because the group of N-th roots of unity is 
cyclic, being generated by exp(2mi/N), and hence so are all its subgroups. 

It remains to prove that the image of the map (10) is a lattice in the 
hyperplane (11). For this, let WV’ be a bounded neighbourhood of the origin 
in (11) or even in R™+7. The points of o* which map into NV have every 
|oa| bounded; so in the standard topology they lie in the intersection of 
o and a bounded set, and hence they form a finite set. Thus the image is 
discrete. It remains to show that the image of (10) spans (11). For this it 
is enough to prove the following assertion:- 


e Given any real Aj,... ,A;,+r, not all equal, there is a unit 7 with 


F(n) =A1 log Join| +--+ + Ap, log |o,, 7] 
+ 2(An +1 log |o,, 4171] nokta Ari +re log lon 4ro7l) #0. 


Let (1,-.. ,Pr;+r2 be positive real numbers such that 
Pi’: * Pry (Pry +1 sie pases): ae (2)" Ia)! = A, (12) 


say, where d is the discriminant of k. In the coordinates defined on page 
18, the set S given by 


|X,| <p, forl <p<ri, [X24 X24 <p; forr) <u <r +72 


is bounded, closed, convex and symmetric with respect to the origin; and 
its volume is 2"!+7?|d|!/2, By Lemma 10 there is a non-zero integer a in k 
such that 


lo,a| <p, forl< peri t+ro; 
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and |norm,/;qa| < A now follows from (12). Since also |norm,;ga| > 1, 


|X,| >A'p, forl<p<n, 


2+ dial 2AM brn <p rita 


Let us say temporarily that a1, a2 in o are equivalent if a1 /az is a unit. 
The elements in an equivalence class are those which generate a particular 
principal ideal, and up to sign their norm is the Norm of that principal 
ideal; so there are only finitely many equivalence classes whose Norms are 
bounded by A, because each of them corresponds to a subgroup of o of 
index at most A. Let (1,... ,8n be representatives of these classes. The a 
generated above lies in one of these classes, so 7 = a/(3, is a unit for some 
v. But now f(n) = f(a) — f(8_) and this differs from 


Ax log pi + +++ + Ap, log pp, + 2(Ani +1 log Pr ta + +++ + Antes log Pritra) 


by at most B = | f(8,)|+ (log A) >> |A,|, which does not depend on the p,. 
We can choose the p, so that (12) holds and the last displayed expression 
exceeds B in absolute value; this ensures f(n) # 0. O 


As it stands, this argument is not constructive because we have no way 
of writing down a complete set of representatives 61,...,8n. But by a 
slight modification of the argument, we can find r1 + r2 — 1 independent 
units; for by choosing different sets of p, we can find as many a with 
|norm,/qa| < A as we wish, and suitable quotients of these will be units. 
Once we have r; +r2—1 independent units, it is in principle straightforward 
to find a base for the group of units; for we can quantify the statement that 
if a is a unit and the image of a under the map (10) is near enough to the 
origin then a is a root of unity. But in practice such calculations can be 
quite tedious. 


Corollary Let S be a finite set, consisting of m primes ino. Then the 
group of units in og is the product of the group of roots of unity in os 
(which is the same as for 0) and a free group onry+r2+m-—1 generators. 


Proof Let p1,...,Pm be the elements of S. There is an exact sequence 
0 — o* — 0% — {free group generated by the p,.}; 
and the image of the right hand map contains each pr and is therefore free 


of rank m. im 


Since the units give rise to a lattice A in the vector space V defined by 
(11) and there is a canonical measure on V, we expect the volume of V/A 
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to be of interest. This volume is called the regulator R of k. To write it 
down explicitly, choose any base m,..-. , 1472-1 for the group 


{units}/{roots of unity} 
and form the (r; + re — 1) x (r1 + r2) matrix whose v-th row is 


(log |oin|,.-- ,log lop, nv], 2log |o,,417],.-- ,2log |on,4-.71)- 


Now delete any column; it does not matter which, since the sum of the 
columns is 0. The regulator R is then defined to be the absolute value of 
the determinant of the resulting matrix. 


4 Change of fields 


Let k C K be algebraic number fields; it is natural to ask how far the 
concepts which we have introduced for extensions k/Q can be extended to 
K/k. For this, one needs to be able to lift certain objects from k to K. 
Elements of k are automatically elements of K, and the property of being 
integral does not depend on the field we are working in; but there is more 
difficulty with ideals. There is only one sensible way of lifting a fractional 
or integral ideal a of k to K; this is by means of the conorm. We define 
2 = conorm,,;,4 to be the smallest (fractional or integral) ideal of K which 
contains all the elements of a; equivalently we could define it as a @, 0. 
It is trivial to check that taking conorms commutes with multiplication of 
ideals, that if a is in k then conormx,;(a) = (a) where the (a) on the left 
is in k and the (a) on the right is in K, and that the tower law 


if K > L > k then conormx;,(conormy,a) = conorm, /,4 


holds for a in k. In other words, the notation is reasonably foolproof. By 
abuse of language, one often writes a where one should write conormx; 4. 
In consequence, we sometimes need to write Norm,a instead of Norma to 
make it clear which field we regard a as belonging to. 

This process gives an apparently attractive way of getting rid of ideals 
and working purely with elements of k: 


Theorem 12 Given any algebraic number field k there is a finite extension 
K Dk such that for every ideal a in k, conormy,,a ts principal. 


Proof Let. a,,a2 be ideals of k in the same ideal class, so that there are 
non-zero (31, 32 in k such that (81)a1 = (G2)a2; then 


(B1)conorm x /,a1 = (42)conorm,/;,42 
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for any K > k. Thus if one ideal in an ideal class of k becomes principal 
in K, ey all do. Now let a1,... , an Ee representatives of the ideal classes 
in k; the a? are all principal in & so ab ae for some a; in k. Let 


K=k(#/ay,... , Yan 
then (conormx/,a:)" = (a@;) and so conorm, ,4; = (#/a4) in K. O 


Unfortunately, this technique for describing divisibility is so clumsy as 
to be unusable. But the theorem has some historical interest, as being one 
of the first signposts towards class field theory. 

If we know that an ideal 2% in AK has the form conorm,,;,a then we can 
recover a as follows. 


Lemma 11 Let K Dk and let a be any fractional ideal ink. Then 
(conormx/,4) Vk = a. 


Proof Since conorm x /,;a D a set-theoretically, all we have to prove is that 
if 8 = )> a, A, is in k, where the a, are in a and the A, are integers in K, 
then @ is in a. Extending K if necessary, we can assume K normal over k. 
Let o1,... ,0n be the elements of Gal(K/k); then 


gs =I] (0,8) = II (= OyIvAy ). 


On the right hand side, the coefficient of any monomial in the a, is an 
algebraic integer in K invariant under Gal(K/k), and hence an integer in 
k. Thus £" is in a”, and so @ is in a. 0 


If 2 is an ideal in K, how should we define norm,,,2? If possible we 
should form the product of 2 and all its conjugates, and transfer the result 
back to k. To do this, we should choose an extension L of K normal 
over k, denote the distinct k-homomorphisms K — LD by o1,...,@n, form 
[] conormy/., «(o,2), which is an ideal in ZL, and show by some analogue 
of the Fundamental Theorem of Galois Theory that this is conorm,/,;4 for 
some ideal a in k. But in this simple form the last step does not work; for 
in contrast with what happens with elements of L, an ideal of Z which is 
invariant under every element of Gal(L/k) need not be the conorm of an 
ideal in k. Suppose for example that k = Q and L = Q(/—5), so that 

= Z[V—5]. The ideal & = (14+ /—5, 1— V—5) consists of all m+n/—5 
with m+ n even; so it is invariant under Gal(Z/Q) and &NQ = (2). But 
 # conormy/aq(2), so % is not a conorm for L/Q. In fact A? = (2) in L. 
The way round this difficulty depends on the following result: 
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Lemma 12 Let a be the ideal in k generated by the normy,;,A where A 
runs through the elements of an ideal XU in K, and let L > K be a normal 
extension of k. Then in the notation above 


conormy, ;,a = [[ conorms jo. x (oi). (13) 
Proof Since o;A is in conormz/¢,«(o:%), normx;,A = [](o;A) is in the 
right hand side of (13). Hence there is an integral ideal £ in LZ such that 


conorm, ;,a = £ II conormy /g, x (ai A). 


We must prove that £ = (1). Let B be an ideal in K prime to £2 K and 
such that 2B is a principal ideal, say AB = (B) with B in K. Since £ is 
fixed under Gal(L/k), conormy,,,«(0i%) is prime to £ for every oj. Since 
B is in A, 


conormy, ;,a D (I 7B) 
= ([] conormz jo,x(2:8)) (I conorm, jo, (ai2)) : 


so [[conorm,/,,« (0:8) is a multiple of £. But []conorm,/,,«(o;%) is 
prime to £. Hence £ = (1). Oo 


In view of this result, we define normy,2 to be the ideal in k generated 
by all the norm,,,A for A in 2. We list the standard properties of norms 
of ideals, which are analogous to those of norms of numbers: 


Lemma 13 Let K > L D k be algebraic number fields, let A,B be ideals 
in K anda an ideal in k, and let A be an element of K. Then 


norm x /,(A) = (norm,;,A), (14) 

norm, /,(norm;;,%) = norm; /,%, (15) 
norm /,(AB) = (normx/~2)(norm;/,B), (16) 

glk) norm x /,(Conorm x/,4), (17) 

(Normga) = norm,qa. (18) 


Proof We can assume that 2%,%,a,A are non-zero, for otherwise these 
equalities are trivial. Now (14) follows immediately from the definition. In 
each of the other equations, the right hand side is contained in the left hand 
side in view of the corresponding equalities for numbers. Now choose ideals 
Q’,B’ in K and a’ in Kk so that 22’, BB’, aa’ are principal, and multiply 
the inequalities for undashed letters by the corresponding inequalities for 
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dashed ones. We obtain statements about numbers, in which equality holds 
by (14); so equality must also hold in the component inequalities. Oo 


Corollary Let K D k be algebraic number fields and let p be a prime ink 
whose factorization in K is conormx,,p = [] P;*- Denote by f, the degree 
of O/B; over o/p. Then norm x /. Bi = pf and [Kk 7 k] = Sefi- 


Proof Taking Norms and using the last two results in the lemma, 


(Norm, (conormy,P)) = normxq(conorm x xP) 
= norm [K:k} _ N [K:k] 
= k/Qp) = (Normep) 


and 
(Norm, Il #*) = [[(Worme Bi)" a [[ Normep)**. 


Comparing these results gives the second assertion. Hence norm, /,P; is a 
power of p by (17), and normy/q(normx /,Bi) = normy;ap* by (15), (18) 
and Normx }; = (Norm,p)#; this gives the first assertion. O 


By analogy, we can define the relative discriminant of K over k as 
follows. Let [K : k] = n and let ay,... ,a, be n elements of Dx linearly 
independent over k. Imitating (1), we write 


Nica (1, +++ Qn) = det(Trx 4 (ap,av)); 


then the relative discriminant of Ox over o is the ideal in k generated 
by all the AZ, / ,(@1,--.,@n). But this is less interesting than the relative 
different introduced in §8. 


5 Normal extensions 


Throughout this section K, k will be algebraic number fields with K normal 
over k, so that G = Gal(K/k) actson K and everything derived from it. Let 
p be a prime ideal in k, and let conormx,p = [] $f" be its factorization in 
K and f; the degree of the field Ox /$; over o;/p. In an obvious notation 
we have the tower laws ex/q = €x/kek/q 2nd fxjq = fx/kfe/q- Ifo is in 
G then also p = [](o;)%. Since each of; is a prime ideal in K, the of; 
must be a permutation of the %;. The key to the results of this section is 
as follows. 


Theorem 13 Suppose K is normal over k with G = Gal(K/k). Then G 
acts transitively on the T;, all the e; have the same value e, all the f; have 
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the same value f, and (G] = [K : k] = efg where g is the number of distinct 
prime factors of p in K. 


Proof Let % be one of the 8;; then $8NK is an ideal in o, containing p but 
not containing 1; it must therefore be equal to p because p is maximal. By 
Corollary 1 to Theorem 7, we can choose a in $ so that (a)/$ is prime 
to p; thus norm, ,a@ lies in BN = p, so that each P, divides some oa. 
But the ideal (ca) is the product of o8 and an ideal prime to p; hence 
; = of. Since [K : k] = efg is now a special case of the Corollary to 
Lemma 13, all the other claims follow at once. oO 


In what follows, we fix our attention on one prime factor $ of p. Let 
Z denote the subgroup of G consisting of those o which fix ; Z is called 
the splitting group (Zerlegungsgruppe) of $ and its fixed field Kz the 
splitting field. The reason for this name comes from 


Lemma 14 Let H be a subgroup of G and let Ky be its fixed field. Then 
, and PB; divide the same prime ideal in Ky if and only if P; = oP; for 
some o in H. 


Proof ‘If? is clearly trivial; for ‘only if? we apply the transitivity property 
of Theorem 13 to the normal extension K/Ky with Galois group H. O 


It follows that Kz is the smallest field ZL between k and K such that 
the prime $M L in L does not split in K, though it may ramify there. 
Moreover if e, f,g refer to % or a prime in a smaller field divisible by it, 
then g(K/Kz) = 1 and [Kz : k] = [G: Z] = 9(K/k), so that 


e(K/Kz)f(K/Kz) = [K : Kz] = e(K/k) f(K/k). 


By the tower laws e(K/Kz) < e(K/k) and f(K/Kz)-< f(K/k); so we 
have equality in both relations. Thus in going from k to Kz we split off 
the prime 8 9 Kz but do not ramify it or extend its residue field. 

Now consider the residue field Dx /%. Every element of the splitting 
group Z induces an automorphism of 0% /8 which leaves o/p elementwise 
fixed. Moreover, if a is in K then the characteristic polynomial of a over 
Kz is o(X) = [], in g(X —oa). Denote reduction mod f by a tilde; since 
we have just shown that #(X) is defined over o/p and all its roots are 7@ 
with o in Z, all the conjugates of @ over o/p have this form. Choosing a 
so that & generates Dx /%8 we deduce that every automorphism of Ox /P 
over o/p is induced by some o in Z. Let T be the kernel of the epimorphism 


Z — Gal((O0x/P)/(0/p)), 
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where we know from the Galois theory of finite fields that the right 
hand side is a cyclic group of order f; T is called the inertia group 
(Tragheitsgruppe) and its fixed field Ky the inertia field of B,. This 
proves all but the last clause of 


Theorem 14 T is a normal subgroup of Z, of order e, and Z/T is cyclic 
of order f. T consists of those elements of Z which induce the identity on 
O/B; these are just the elements of G for which a = camod for all a 
in On. 


For the last clause, we need only note that if o is not in Z then we can 
choose a so that a is in $8 but not in o~!$, whence afca mod}. Now 
Gal(K/Kr) = T, which is also the inertia group of $ for the extension 
K/Kr; thus f(K/Kr) = 1, whence e(K/Kr) =e and so PN Kr = f°. 
Thus %M Kz remains prime in Ky, which is why Kr is called the inertia 
field, but going from Kz to Kr induces an extension of degree f of the 
residue field. 

We can go further. Choose II in Ox so that || and consider those 7 
in T for which oll = mod 8?. In view of the last clause of Theorem 14 
this property does not depend on the choice of H, and hence such o form 
a group V; it is called the ramification group (Verzweigungsgruppe). 


Theorem 15 V is normal in Z, and is the unique Sylow p-subgroup of T; 
moreover T'/V is cyclic and its order divides Norm(B) — 1. 


Proof If p is the rational prime underlying P then any element of V has 
order a power of p. For if o is an element of V other than the identity then 
we can choose II so that oll A Il. Thus oll = 1 + all™ mod P+? for 
some m > 1 and some a in Ox not divisible by P. Iterating, we obtain 
ol = M+ rall™ mod $+, Thus o cannot have order prime to p, and 
the same happens for any power of o other than the identity. Now let o be 
any element of T and write 6, = of/H; then 8, does not depend on the 
choice of I and o — £8, is a homomorphism from T into the cyclic group 
(Dx%/9)* with kernel V. This implies in particular that V is normal in T. 
But since (0 « /$)* has order prime to p, V is p-Sylow. Uniqueness follows 
from the facts that all Sylow p-subgroups of an arbitrary finite group are 
conjugate and V is normal in 7. That V is normal in Z follows from the 
facts that T’ is normal in Z and V is uniquely determined by T. O 


It is possible to investigate the structure of V further, by defining the 
higher ramification groups. For these, and their connection with the differ- 
ent (introduced in §8), see Chapter IV of [Se]. It turns out that a knowledge 
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of the chain of ramification groups associated with $B tells one the exact 
power of $8 which divides the different. This both strengthens Theorem 21 
in Chapter 2 and proves the assertion made in §8 that the different is a 
measure of the badness of bad primes. (The assertion is still valid for non- 
normal extensions, but the proof requires more complicated notation.) Of 
course, once one has obtained Z the rest of this section is really local theory 
and might be better expressed in the language of Chapter 2; for in that 
language Kk, = Kz within Kg and hence Z is canonically isomorphic 
to Gal( Kg /kp). 


If e = 1 then T is trivial; and Z = Gal((O0%/$)/(o/p)), which is now 
cyclic of order f, has a natural generator given by &@ H GNo™P for all &. 
The corresponding element of G is called the Frobenius element and is 
denoted by (I. It is uniquely determined as an element of G by 


Fe] a = aX? mod (19) 


for all a in Ox. It obviously has the property 


=P 
of By 

for every o in G; thus it is defined up to conjugacy by a knowledge of p. In 
particular, if G is abelian then this symbol depends only on p; in this case 
it is called the Artin element or Artin symbol and is denoted by (AL). 
(Note the change from square to round brackets.) By multiplication we 
can now define the Artin symbol (#4) for any fractional ideal a which 
involves no ramified prime; and by construction the map a ++ (42) isa 
homomorphism. The Artin symbol plays a central role in class field theory. 

In some contexts one needs to extend the definition of the Frobenius and 
Artin elements to the case of primes ramified in K/k. There is only one 
sensible way to do this; that is to define (FI to be the set of all elements 
of G which satisfy (19). This identifies the Frobenius element as a member 
of Z/T, or equivalently as a left coset of T in Z or even in G. 

It is natural to ask how all these objects behave under change of field. 
For this purpose we suppose that K D L D k, that K is normal over k with 
G = Gal(K/k), that H = Gal(K/L) and that is a prime ideal of K and 
Q, p are the corresponding prime ideals of L,k. If we consider L/k we shall 
also need to assume that H is normal in G, so that Gal(L/k) = G/H. 


Theorem 16 With the notation above, ZK ;, = ZK;,H and similarly 
forT and V; and (A) = [Ayr where NormQ = (Normp)", so that f’ 
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is associated with Q as a factor of p. If H is normal inG then Z,/, ts the 
image of Zx;~ in G/H and similarly for T and V; and [S) is the image 
of (5). 


Proof The first half of the first sentence is immediate, and the second 
follows from (19) because [XA " is the only element of G with the requisite 
property. Now assume that H is normal in G, and let o be an element of 
G. The image of o in G/H is in Z,,, if and only if of divides 0; and 
since H is transitive on the prime ideals in K which divide Q, this happens 
if and only if oH meets Zx/,. A similar argument works for T, and the 
result for V follows from the fact that V is the unique Sylow p-subgroup 
of T. The result for the Frobenius symbol follows trivially from (19). O 


Suppose that we consider another prime factor of of p instead of ; 
then the Z,T,V for of are obtained from those for $ by conjugation by 
o. In an obvious notation, Zon = oZgo—' and so on. We have already 
noted that the corresponding statements hold for the Frobenius and Artin 
elements. 
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Valuations 


6 Valuations and completions 


For any field k, an absolute value, also called a multiplicative valua- 
tion, will be a map k —+ R denoted by a+ |la|| which for some a > 0 
satisfies the conditions ||a|| > 0 for a 4 0, |]0|| = 0 and 


llell - 18l] = lle], la + BIl* < |lall* + [1I!*. (20) 


Readers are warned that some authors use a more restrictive definition. 
Two valuations |J.||; and ||.||2 are called equivalent if |lal|2 = |lal|f for 
some fixed c > 0 and all a. An equivalence class of valuations is called a 
place. Any valuation makes k into a metric space with metric given by 
d(x1,22) = ||z1 — £2||*; this metric depends on a, but the induced topology 
only depends on the place. 


Lemma 15 Distinct places induce distinct topologies on k. 
Proof Suppose that ||.||1 and ||.||2 induce the same topology on k. Since 
zl] <1 <=> |l2z"|| -0asn-00 = 2" -+0asn > 00, 


l[zl|1 <1 if and only if ||z\lz < 1. We can assume that there exists r9 # 0 
with this property, for otherwise ||z||,1 = ||z|l2 = 1 for all 2 #4 0. Define 
ce > 0 by ||zoll2 = ||zoll{ and for any x # 0 in k with ||z||1 < 1 let A satisfy 


zll1 = llzoll$. If m/n > A then |[z7/z"||1 < 1 whence |[z7*/z"l/2 < 1 
and ||z\l2 > ||zollz'/"; similarly if m/n < d then ||z\l2 < {\zoll%/". ‘Thus 
lIzll2 = [|20ll3 = llzollf* = [lzilf. O 


At this stage it must appear more natural to get rid of a and replace the 
second relation (20) by the usual triangle inequality 


lla + All < \lal] + 11411. (21) 
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However on page 35 we shall see that in each place there is a natural 
distinguished element (called a normalized valuation), and unfortunately 
it does not always satisfy (21). 

In what follows we shall ignore the trivial valuation |{a|| = 1 for all a 4 0, 
which corresponds to the discrete topology. A multiplicative valuation on 
a field k is called Archimedean if k has characteristic 0 and ||ml| > 1 
for some m in Z, and non-Archimedean otherwise; the excuse for this 
terminology is that for an Archimedean valuation ||m|| tends to infinity 
with m, and Archimedes wrote a book called On Large Numbers. It turns 
out that Archimedean and non-Archimedean valuations have significantly 
different properties. 


Lemma 16 If k is an algebraic number field, the Archimedean valuations 
on k are given by |la|| = |oa|* where c > 0 ando is any embedding k + C. 


Proof The function on k defined in this way is certainly an Archimedean 
valuation, so let ||.|| be any Archimedean valuation. It is clearly enough to 
prove the result when a is in 0. Replacing ||.|| by an equivalent valuation 
if necessary, we can assume that (21) holds. The first task is to prove 


© ||ml| = |m|° for all m in Z and some fixed c > 0. 


Clearly {|+1||? = ||1|] = 1, so we need only consider the case m > 1. Choose 
some mg > 1 and for any m, N write m% in the scale of mo: 


mN = So avme where 0 < a, < mp. (22) 


Here the sum is taken over 0 < vy < Nlogm/logmp. Let A be an upper 
bound for |/a|| where 0 < a < mp. If we could choose mo with ||moll < 1, 
applying (21) to (22) would give ||m||N < A(1+ Nlogm/log mo); taking 
N-th roots and letting N — oo would then give ||m|| < 1 for all m > 1, 
contrary to hypothesis. Thus ||m|| > 1 for all m > 1. But now the same 
argument gives 


I|m| < A(1 + N log m/ log mo)||mo||% °&™/ os mo ; 


taking N-th roots and letting N — oo we obtain ||ml| < ||mp||!@™/!8™, 
If we had strict inequality here, we could interchange the roles of mp and 
m and obtain a contradiction. So |}m||!/'8™ is independent of m, and if 
we choose c so that it is equal to e° our assertion follows. 

Now let a be a non-zero element of 0, and order the a, : k > C so that 
lola] > |ov410|. (This ordering depends on a@ and should not be confused 
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with the ordering used in §3.) Let c have the value obtained in the previous 
paragraph. Write 


v=m 
[[(X - 0%) = xP 4 a:X™ 14. 4an, Pa = [] (ve). (23) 
o v=1 


The am are symmetric functions of the ca% and the largest summand in 
Om is +Pm; so |am| < C|Pm| where C depends only on n. Moreover, if 
|om+10| < |oma| then once JN is large enough this summand is much larger 
than any other in am; 80 |am| > $|Pm|. Also ||am|| = |am|° because the am 
are in Z. If 


lo,a|° > |lal] > Jousiel° 


for some pz, then in the first equation (23) with a for X the term a,aN(-#) 
on the right would be much larger than any of the others, which contradicts 
the triangle inequality since the sum of all the terms vanishes. We get a sim- 
ilar contradiction if ||a|| > |o1a\*° or ||al| < |ona|*; so ||a|| = |o,a|° for some 
p. Apparently y here might depend on a; but using ||a@ || = |lal] - ||G||% 
with N large we find first that we can require p to be the same for 6 and 
af" and then that it is the same for aB% and a. oO 


A refinement of this argument actually proves that any Archimedean 
valuation on any field L is equivalent to one defined by ||z|| = |oz| for some 
embedding o : L — C. In our case there are r) +12 classes of Archimedean 
valuations — one for each real embedding k —+ R and one for each pair of 
complex conjugate embeddings k — C. Note that R or C respectively can 
be identified with the completion of k under the appropriate topology. The 
Archimedean places are often called the infinite places, or by an abuse 
of language motivated by Lemma 17 below, the infinite primes. 

For non-Archimedean valuations we can radically improve (21), for if ||.|| 
satisfies (21) and is non-Archimedean then 


N N 
a N- N= 
lla + BIN =" oanwaN "BI < D7 llan.wil la" 9" I. 

Here the an, are integers, so their absolute values are at most 1. The 
right hand side is bounded by (N + 1) max(|lall, |||)". Taking N-th roots 
and letting N — oo gives 

\|a + Al] < max(|lal], ||5]I); (24) 
and if this holds for one valuation in an equivalence class, it holds for them 


all. Applying (24) to a = (a + 8) + (—) gives 
lal] < max(||o + 4], |||); 
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and by symmetry there is equality in (24) whenever ||| # ||G||. A metric 
having this property is called an ultrametric. 


Lemma 17 Let ||.|| be a non-Archimedean valuation. There are a non-zero 
prime ideal p of 0 and a constant C > 1 such that |la|| = C~™ for alla 40 
in k, where m is defined by p™||a. 


Proof Any a # 0 in o satisfies an equation 
a™ +4,077!4+---+an =0 


where the a, are in Z. If |ja|| > 1 then the first term on the left would have 
strictly larger valuation than any of the others, which contradicts (24). So 
lal] < 1 for all @ in o. If |la|| = 1 for all a A 0 in & then our valuation 
would be trivial; so there are integers a 4 0 with |lal| < 1. Using (24) 
again, the set of a with ||a|| < 1 form an ideal p; and p is prime because 
|Ja1ag|| < 1 implies |]a1|| <1 or lfagl| < 1. 

Now choose 7 in p but not in p? and let a be any non-zero element of k. 
In the notation of the lemma we can write (a/m™) = a;/a2 where ai, ag are 
integral ideals prime to p. By Theorem 7 we can find {2 in ag and prime 
to p. Write 6, = B2a/m™, so that (, is in a1. Neither (1 nor (2 is in p, so 
they both have valuation 1; thus ||a|] = {7||™. Oo 


Conversely, by the formula in the lemma any p and C determine a non- 
Archimedean valuation; and changing C only changes the valuation within 
its place. This place can be identified with p, and will be called a finite 
place. Denote by ky the completion of k under the metric associated with 
p. An element a of ky is determined by a Cauchy sequence {a,} where 
[nm — Qn|| + 0 as m,n — oo; hence ||a,|| tends to a limit, which we can 
define to be ||a/|. It is easy to check that k, is a field containing k, and that 
||.|| determines a non-Archimedean valuation on ky which extends the given 
valuation on k. If we denote by o, the set of a in ky such that |la|| < 1 and 
by pp the set of a in kp such that |la|| < 1, then o, is an integral domain 
whose quotient field is k, and py is its only non-zero prime ideal. Note 
that op/p, can be canonically identified with o/p. The elements of o, are 
in one-one correspondence with the nested sequences of residue classes 


a; modp > agmodp? > --- D> ammodp™ D---. 


Now let the ,, be a fixed set of representatives of the residue classes mod p 
in 0, and let 7 be in p but not in p?. (When we work with k, we shall 
consistently use 7 in this sense.) Thus the 1”~1,, are a set of represen- 
tatives of the residue classes mod p” in p*~1; hence the elements of 0, are 
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the )°0° Yn7” where each 7, is a G,. Similarly the elements of ky are the 
Ny Wnt”. 

The field k, is an example of a local field, a subject for which there is a 
good introduction in [Ca] and a comprehensive account in (Se]. Analysis in 
kp is much easier than in R; for example, it follows from (24) that a series 
in ky converges if and only if its terms tend to 0. The next three lemmas 
are further examples of this; they are stated only for k,, but it will be clear 
from the proofs that they hold in much greater generality. 

If we wish to refer to a place which is not necessarily finite, we usually 
denote it by v and denote the corresponding completion of k by ky. If v is 
infinite, it is conventional to take 0, to be the same as k,. 

The formulae for ||.|| in Lemmas 16 and 17 each contain an arbitrary 
constant. But there is a canonical way of deriving a valuation on k, in the 
class of ||.|| from the induced topology on k. For any a in k,, multiplication 
by @ gives a map of k, to itself which multiplies the natural measure of 
volume on k, by a constant; we choose this constant to be |la||. (The 
volume on k, is only determined up to an arbitrary constant, but this does 
not affect the value of |{a/|.) This process is also described, from a slightly 
different starting point, in §A1.3. For infinite places this gives ||a|| = |ca| 
if o is real and ||a|| = |oa|? otherwise; for the finite place associated with p 
it gives ||a|| = (Norm,p)~™ where p™ is the exact power of p which divides 
(a). This and Norm((a)) = |norm,/Qqa| immediately give the Product 


Formula 
[[ lel. =1 
v 


where the product is taken over all normalized valuations. 

Taking the logarithm of a non-Archimedean valuation and renormalizing, 
we obtain the additive valuation a ++ vp(a) = v where p” is the exact 
power of p which divides a. (The double use of v is traditional, and should 
not cause confusion.) The corresponding rules for this are 


Up(@B) = Up(a) + p(B), Up(a+ 8) > min(vp(a), vp(f)), 


and in the latter rule we have equality whenever the two arguments on the 
right are unequal. 

Lemma 18 below says that under modest conditions factorizations of 
polynomials can be lifted from o/p to o9. This and Lemma 19 are special 
cases of Hensel’s Lemma, for which the underlying idea is due to Newton. 
In its most general form (which belongs to algebraic geometry), this enables 
one to refine an approximate solution (in k or k,) of a set of polynomial 
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equations in k or ky to an exact solution in k,. In Lemma 18 we shall again 
use a tilde to denote reduction mod p. 


Lemma 18 Let f(X) be a polynomial in op[X] and f(X) = $1(X)¢2(X) 
where $1, @2 in (o/p)[X] are coprime. Then there exist polynomials fi, fa 
in op[X] with f(X) = fi(X) fo(X), deg fr = deg ¢: and f,(X) = 6.(X). 


Proof We construct polynomials f{”, (a) in op[X] for n = 1,2,... whose 
reductions mod p are @1, ¢2 and silitch have the propertiesdeg fo = deg ¢1, 
deg f< < deg f — deg ¢1, 


p'IF- AM AD) and p'i(ffrtD - f£) for v= 1,2; (28) 
then the fr = lim fy, {") will exist and have the required properties. 


For the fi? we lift the ¢, to op[X] in any way. To construct the f, 
from the A? we proceed as follows. By hypothesis 


(n+}) 


f= ff 427A for some h() in 0,[X] with deg h™ < deg f. 


If we choose f{"*)) = ¢{” +g") with the g\” in op[X] then the second 
condition (25) will certainly be satisfied, and the first one will be equivalent 
to 


A”) = Ag” + fy” g{") modp, 
and therefore also to 


AP = big + dag 


Since $1,¢2 are coprime and (o/p)[X] is a principal ideal domain, there 


are polynomials 71,2 in (o/p)[X] such that dive + dav = h(™ and 
deg 1, < deg ¢1; and we can take the g to be any lifts of the w,. QO 


The next result is not quite so obvious as one would expect. 


Corollary Let K,k be algebraic number fields with K > k and let B,p 
be prime ideals in K,k respectively such that Blp. If a is in Og then a is 
integral over Oy; in particular Trim /kp & and NOIM Ky /k,% ATE iN Oy. 


Proof Suppose that P°||p and choose II in O so that $||IT. Let By,...,Bn 
be a base for 0/8 as an (o/p)-vector space. The representation on page 
35 implies that the N+B, with 0 < yp < e form a base for Og as an op- 
module. Hence Kg is algebraic over ky. In what follows, we shall use the 
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absolute value associated with $8, which clearly induces an absolute value 
on k associated with p. Let 


f(X) =coX™ +.X™ 7) 4+---+m (co =1) 


be the minimal monic polynomial for a over ky. We assume that the c,, 
are not all in op and obtain a contradiction. Let b in o, be such that the 
be, are all in o, but not all divisible by p. If bc, were the only one of the 
be, not in p then c,, would have strictly larger absolute value than any of 
the other terms in f(a) = 0, contradicting the ultrametric law (24). In any 
other case, we can use the lemma to lift the factorization bf(X )-ltoa 
non-trivial factorization of bf (X) over op, and f would not be minimal. 0 


It follows from Lemma 18 that if f(X) = 0 has a root of multiplicity 1 
in o/p then this root can be lifted to a root of f(X) = 0 in op. Here the 
multiplicity 1 condition is inconvenient, though we have-to pay a price for 
dropping it. 


Lemma 19 Let f(X) be a monic polynomial in 0,[X] with formal deriva- 
tive f'(X), and let a in oy be such that || f(a)|| < ||f’(a)||?. Then there is 
a unique root a* of f(X) =0 in oy such that 


Ila* — al] < HACIA FDI < I F()I- (26) 


Proof We construct a sequence a, = @,@2,... of elements of o, such that 


F(ansi)ll < {IF(@DI/IF (IP SI F(an)I < IF (an)Il, (27) 
f'(onsa)l = IF (@)I,  lonta — all < F(on)I/IF (DI. 
These relations imply that |lan41 — Q@n|] — 0, so that the sequence tends 


to a limit a* which is clearly a root of f(X) = 0. 
If On41 = On + Bn for some ZB, in oy then 


f(Qns1) = f(an) + Baf'(Qn) + integral multiple of 6?. (28) 
Take Bp = —f(an)/f'(@n), so that the first two terms on the right of (28) 
cancel. Now the penultimate claim in (27) follows from 
f'(O@n41) = f'(@n) + integral multiple of 6, 


because the second term on the right has strictly smaller value than the 
first; and then the remaining claims follow at once. 

It remains to prove uniqueness. But suppose aj, a5 are distinct roots of 
f(X) = 0 satisfying (26); then f(X) = (X — af)(X — a})9(X) with g(X) 
in op[X], whence |} f’(aj)|| = llaj — gl - lg(@7)Il < lloy — a3ll. 0 
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Some f are such that we can start this successive approximation process 
with a worse estimate a. An example which will be needed later is as 
follows. 


Lemma 20 Let p be the rational prime below p and let ao,€ in 05 be such 
that of = Emodp™'" where p™||p and r(p—1) > m. Then there exists a 


in-o5 such that a? = €. 


Proof We again construct a sequence a,@1,... such that 
€=a%,,modp™*t™t!, any) = Op mod p™*"; (29) 


then @ = lima, will do what we want. Suppose that we have already 
chosen Q,...,Qn, and write Qn4; = @, +7"*"G, where Bp, must be in 
Op. The first congruence (29) will be satisfied if 


é _ aP = pr®*gP-18, mod perrentt 


and this is always possible. Oo 


Lemma 21 (i) Let L = kp(q) be a finite algebraic extension of kp and let 
f(X) be the minimal monic polynomial for a over kp. If 9(X) in kp[X] 
is monic and close enough to f(X), then there exists B in L such that 
9(B) = 0 and L = kp(B), and g(X) is irreducible over ky. 

(ii) Every finite algebraic extension of ky lies in some Ky where K is a 
finite algebraic extension of k and 8 is a prime of K above p. 


Proof To prove (i), we would like to apply Lemma 19 to a as an approxi- 
mate solution of g(X) = 0; but this is illegitimate because we do not yet 
have a valuation on ky(a). Instead we replace f(X) by a polynomial go(X) 
close to f(X) and in p[X]; and if @ is a root of gq we choose a neighbour- 
hood WN of f in the space of all monic polynomials in k,[X] with the same 
degree as f so that the conditions of Lemma 19 hold for 6 and any g in NV. 
For k,(@) to be meaningful we must embed k, and @ in a common field; 
it will be convenient to take this field to be Ky, where K is the splitting 
field of go(X) over k and $B is any prime of K lying above p. Clearly the 
valuation on K associated with 8 restricts to a valuation on k associated 
with p, though the latter may not be canonically normalized; thus we can 
use ||.|| to denote both of them. 

After multiplying a by a suitable element of 0, we can assume that f(X) 
is in op[X]. Define D(91, 92) = [] 9} (6i) for any monic polynomials gy, 92 
in ky[X] and of the same degree as f, where the product is taken over all 
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roots £; of gg; thus D(g1, 92) is equal to a polynomial in the coefficients of 
gi and go, and c = ||D(f, f)|| 4 0 because f has no repeated roots. We 
require N to be so small that ||D(g1, 92)|| = ¢ for all 91,92 in N and that 
every g in N is in o,[X]. We impose the further condition on N that for 
g in N every coefficient of f — g has absolute value strictly less than c?. 
If we take gg to be in NV 0[X] then every coefficient of g — go admits the 
same bound and hence 


II9(2)Il = Ilg(@) — g0(8)|I < ¢?. 


If @; is any root of go, then g’((;) is in Og; so |{g’(8;)|| < 1 and therefore 
\l9’(B)|| 2 c. Hence the hypotheses of Lemma 19 are satisfied, so to each 
root (3; of go there corresponds a root 4; of g in Og with ||; — G;|| < c. But 
if G:, 8; are distinct roots of go then ||; — ;|| > |l9o(G)|| > c so if also 
ly; — B;|| < c then y% and 7; are distinct. Since g has only as many roots 
as go, each root of g must occur as a 7; and is therefore in Om. Moreover 
kp (vi) C kp(Gi) C Kp for some root ; of go. 

Taking g to be f, this proves (ii). Knowing this, we can take go in the 
argument above to be any polynomial in V. But kp(@) > kp(a), so that in 


deg 9 > [kp(@) : kp] > [kp(a) = kp] = deg f 


the two outer terms are equal; it follows that k,(@) = kp(a) and g(X) is 
irreducible over ky. O 


In the language which we have introduced in this section, we can restate 
the Chinese Remainder Theorem (Theorem 7) as follows. Let pi,..., Pm 
be distinct prime ideals in o and a;,...,Qm any elements of 0; for any 
€ > 0 we can find a in o such that ||a — ayllp, < € for each yp. A variant 
of this which does not discriminate against the infinite places is the Weak 
Approximation Theorem: : 


Theorem 17 Let v1,...,Um be distinct places of k and a,...,Qm any 
elements of k. To any € > 0 we can find a ink such that ||a — ayllu, < € 
for each p. 


Proof We saw on page 3 that k is dense in [| k,, where the product is over 
all infinite places; so we can find @ in k satisfying ||G — ay||v, < $e for all 
the infinite places among the v,. Choose M > 0 in Z such that M(a, — A) 
is in o for each finite place v,. By Theorem 7 we can find y in o such that 


lly ~ M(y — B)llv, < 5€l|Mllv, 
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for each finite place v,; then a = 7/MN + G will satisfy all our conditions 
provided that N > 0 in Z satisfies 


|Nllv,, > 2e7*||y/Mllv, for each infinite place v,, (30) 
N= lv, < del] M/llo, for each finite place v,. (31) 


For this, it is enough to choose N so that N —1 is divisible by a large power 
of Normp, for every p, associated with a finite place v,. O 


Exercise In the notation of the theorem, prove by induction on m that 
there exists @ in k such that ||||,, > 1 and ||Glly, <1 for p = 2,...,m. 
[Use the proof of Lemma 15 to find y with ||y||v, > 1 and |lyIIv,, < 1. By 
the induction hypothesis there exists 6 with |||], > 1 and |[d|., <1 for 
p = 2,...,m—1. Now take GB = 46" if |[dl|u,, < 1 or B = y6"/(1 4 8") 
if {d|ly,, > 1, where r is large.] By considering 6"/(1+ 8") for r large, 
deduce the theorem in the special case a; = 1, ag = --- = Gm = 0 and 
hence derive it in general. QO 


The significance of the phrase ‘weak approximation’ will be explained 
in 89, where Theorem 17 will be translated into the language of adéles. 
Roughly speaking, a weak approximation theorem asserts the existence of 
one or more elements of k satisfying certain conditions, whereas a strong 
approximation theorem asserts the existence of one or more elements of 
0, or os for a preassigned finite set S of prime ideals, satisfying certain 
conditions. (Thus Theorem 7 is a strong approximation theorem.) Strong 
approximation theorems are in general much harder to prove than weak 
approximation ones, and are usually uglier to state. For example, the 
strong analogue of Theorem 17 is trivially false for o. If we add the further 
condition that there is at least one infinite place which is not among the 
Uy, we obtain a plausible but unproved conjecture. 


7 Field extensions and ramification 
Let K = k(q) be an algebraic number field with [K : k] =n, let ¢(X) be 
the minimal monic polynomial for a over k, let p be a prime ideal in o, 
and let p be the rational prime divisible by p. There is a close relationship 
between the factorization of @ in ky and the factorization of conorm, /,p 
in Ox. The latter must have the form 


conormx/;p = I] ve (32) 
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for some primes f,,... , PB, in Ox. We say that p ramifies in K/k if some 
€, > 1; the ramification is called wild if some e,, is divisible by p, and tame 
otherwise. If we write norm, By = pf«, the Corollary to Lemma 13 gives 


n=[K:k]= Seen fu: (33) 


We temporarily drop the subscript yp. Let (1,..., 6s in Ox be such that 
their images are a base for 0/8 as an (o/p)-vector space, and let IT in Dx 
be such that ||I]. Then the 6;I? with 0 < j < e form a base for Ox / as 
an (o/p)-vector space, so that [Ks : kp] = ef. We can however strengthen 
this result considerably: 


Theorem 18 In the notation above, there is a natural isomorphism 
K @ kp = Ky, 8--- OKg, (34) 
both algebraically and topologically. 


Proof By (33), both sides of (34) are kp-vector spaces of the same finite 
dimension. There are natural maps K ®; kp — Kg, and hence there is a 
continuous vector space homomorphism ¢ from the left hand side of (34) to 
the right hand side. But ¢(K) is dense in the right hand side, by Theorem 
17, so ¢ is onto. 0 


Theorem 19 In the notation above, let ¢(X) = [] dy(X) where the $,(X) 
are irreducible in ky[X]. Then after renumbering, deg gy, = eufy and 
by(X) is the minimal monic polynomial for a over Kg, . 


Proof For each fixed yp, it follows from Theorem 18 that a generates Ky, 
over ky. Let ¢,(X) be the minimal monic polynomial for a over ky when 
both are considered as lying in Kp,. (This will depend on yp; the point 
is that we cannot define binary operations between a and elements of ky 
until we have embedded them both in a common field.) Thus ¢,(X) has 
degree [Kq : kp] = ey f, and divides ¢(X); comparing degrees and using 
(33), it only remains to show that the ¢,(X) are distinct. But the topology 
on Ky, is determined by a knowledge of ¢,,(X), and by Theorem 17 the 
topologies induced on K by the various $8, are all different. O 


We shall shortly see that when k = Q the p with some e > 1 are just 
those which divide dx; hence for any k the primes p in o which ramify 
in K/k also divide dx. Assume for convenience that we have chosen a to 
be an integer. In the notation above ¢(X) is in o[X] and if one excludes 
finitely many p then ¢(X) = 0 has no repeated roots. If so, by Lemma 
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18 the factorization of ¢(X) over k, corresponds to that of o(X ) over Fy 
where g = Normp; and if ¢:(X) corresponds to $8; then we can take 
1 = (p,vi(a)) where ¥i(X) is any lift of $1(X) to o[X]. We can actually 
do better than this. 


Lemma 22 With the notation above, let a in D be such that the index of 
ola] in O is finite and prime to p. Then O(X) = T](vu(X)) where the 
wu(X) are in Fy[X], coprime and irreducible, and deg yy, = fu- If 9u(X) 
is any lift of py(X) to o[X] then PB, = (p, g,(a)). 


Proof Let y(X) be a monic irreducible factor of $(X) over Fy and let # in 
F, be a root of ~(X). Every element of © can be written as 707’ c,a” 
where the c, are in k with denominators prime to p and n = [K : k]; 
hence there is an epimorphism 0 — F,[(] given by a ++ @ which extends 
reduction mod p. Let 8 be its kernel; since the image is a field, 8 is a prime 
ideal which divides p. If e, f denote the standard values corresponding to 
$8 then degy, = f because F,[G] ~ 0/% has order (Normp)/. Moreover 
8 > (p,9:(a)). Conversely any element of 8 has the form h(a) where 
h(X) is in k[X] and the coefficients of h have denominators prime to p; and 
h(8) = 0 so that y, divides h. In other words h(X) — gi(X)h1(X) has all 
its coefficients in p for some hy(X) in o[X]; setting X = a we deduce that 
y is in (p, 91(@)). In particular P\|g:(a@) if e > 1. 

We have ¢(X) = [](vy(X)) for some cy. Thus $(X) — [](gu(X))™ 
has all its coefficients in p, so [](g,(a)) lies in p. On the other hand 
9u(a) maps to w,(8) 4 0 if w # 1, so g,,(@) is not in PB and the distinct 9, 
correspond to distinct prime factors of p in K. Hence (gi(a@))* is divisible 
by B°; so c; 2 e if e > 1, by the last result in the previous paragraph. 
This also holds if e = 1. Comparing [K : k] = }\ cy f, with (33) shows that 
c, = e and that each 8, is generated in this way. O 


The most favourable situation is when DO = o[a], which seems to happen 
quite often. (There is a problem with assertions like this, because it is 
not clear what one should mean by an arbitrary algebraic number field.) 
But there are pairs K,p for which there does not exist an ao satisfying the 
conditions of Lemma 22, though this can only happen if Normp < [K : k]. 
(See Exercise 2.8.) In such cases there may still be labour-saving tricks 
available, but sometimes the least onerous way to find how p factorizes is 
to study D/p. However, the following lemma, which is needed for a quite 
different reason, does enable one to modify Lemma 22 so as to compute 
the 8, and their associated e,,, f, one by one. 
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Lemma 23 Let p, PB be prime ideals in algebraic number fields k, K respec- 
tively with K > k and B\p. There exist an algebraic integer a in K and 
embeddings of 0p and a into Ky such that Ky = kp(a) and op[a] = Dg. 


Proof Let € generate 0/8 over o/p and lift € back to an integer a in 0; 
then o,[a] contains representatives of all the classes in 0/8. If it also 
contains an element divisible by only to the first power, then it contains 
representatives of every class in %™/9™+! for each m > 0, and the lemma 
follows at once. But let ¢(X) in o[X] be such that $(X) is the minimal 
monic polynomial for € over o/p, and let ¢’(X) be the formal derivative of 
¢(X). Here ¢ (X) does not vanish identically because 0/8 is separable 
over o/p; hence o! (€) 4 0 because @! has smaller degree than ¢. If Il in O 
is such that ||] then P\¢(a) and 


g(a +11) = o(a) + ¢/(a) mod $?. (35) 
By (35) at least one of a and a + TI] satisfies the condition in the second 
sentence of the proof, and they are both lifts of €. O 


Exercise Obtain the analogues of Theorems 18 and 19 for infinite places. 
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Many number-theoretic objects give rise to an ideal which identifies the bad 
primes for that object and measures how bad they are. Such an ideal is 
important primarily because it has this property, but the definition does not 
always make this evident; on occasion this fact can be a non-trivial theorem. 
Typically such an ideal is called the conductor, but for an extension K/k 
of algebraic number fields it is called the (relative) different. 

By Lemma 3, every k-linear map K — k has the form a++ Tr), (a) 
for some @ in K. It is natural to ask which are the @ for which the image 
of Ox under this map lies in o,. Let S be the set of such {; clearly S is 
an 0x-module which contains Ox. On the other hand, Trx;q(a@@) must 


lie in Z. So if aj,...,@y are a base for Ox as a Zmodule then S must 
lie in the Z-module spanned by {;,... ,8n where G, satisfies 
1 ifp=v 
= , 36 
Tr(au.) { 0 otherwise. (30) 


If we write B, = >> OpCpy with cp, in Q then we have the matrix relation 


(Tr(ay@p))((Cpv)) = I. 


It follows that each dxc,, is in Z where dx is the discriminant of K, so 
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S is contained in the fractional ideal dD Kj) since S is an Dx-module, it 
is itself a fractional ideal. Since also S D Ox, it is natural to write S as 
0-1 = 0; where 0x, is an integral ideal called the different of K/k. If 


k = Q, the 6, defined above span MQ and it follows that 


Norm %K/Q = | det(Trx/Q(apap))| = |dx|. (37) 


Now let K > L > k and let @ run through the elements of Dx, A through 
the elements of DO, and y through the elements of 0,/,. If a is in K then 


a in an <=> Trx i (@@) is in ox for all 8 
=—_ Trx/k(@BA) = Trp e(ATYK/1(a8)) is in 0, for all B, A 
<=> Trx/1 (a) is in Ok for all 8 
<=> 7TrK/1 (a8) = Trx;t(@78) is in Dy for all 6,7 
<= oy is in tL for all y = > a in LPL Ki 
and therefore 
Vek = OK/LOL/k- (38) 


Let 8 be a prime ideal of K and let p be the prime ideal of k which 
it divides. By analogy with what we have already done globally, we can 
consider the set of @ in Ks such that Trxq/%,(@@) is in op whenever a is 
in Og. As in the global case, this set is a fractional ideal in Ky; we again 
write it as 0-! and we now call 0 the local different for the extension 
Ksg/kpy. We identify 0 with a power of 8, though strictly speaking it is a 
power of Py. 


Lemma 24 The global different 0x /, 1s the product of the local differents. 
Proof Suppose that n generates K over k; then by Theorem 19 


Trxskn = >, TrKp/kpn for nin K. (39) 
p 


But any 7 in K can be written as 7’ — 7" where each of ’ and 7” generates 
K over k, and (39) holds for 7 because it holds for n' and 7. Now let p 
be any prime ideal of k. It follows from (39) that if @ in K lies in Oem des 
for each P above p and if a@ is in Ox then Trx,;,(a) is integral at p. By 
considering all such 8 which are integral at every prime of K which does 
not divide p, we conclude that []0 x. /%, divides 0x, where the product 
is taken over all P dividing p. 

Conversely, suppose that $"||0%,, and let G in K be in $~" but not in 
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B'~". For any a in Ox it follows from (39) that Trx, x, (@@) is in o,, for 
all the other terms in (39) are certainly in oy. Thus @ is in 0x,, /k,» Whence 
BVicay /kp- QO 


The proof of Theorem 20 below is complicated. The motivation behind 
it is that if we write [K : k] = n, we can find Dek by taking a base 


Q1,+-+,Qn for Ox as an o,-module and solving the equations (36) for the 
B,, where now the trace is for K/k. If we could choose the a, to be 
1,a,... ,@"—1 for some integer a with K = k(q), there would be a sensible 


formula for the solution of (36). In the first part of the proof we derive 
such a formula; instead of using brute force, we apply a trick which is 
worth knowing because it turns up in a variety of contexts. For the global 
extension K/k we cannot expect to have such a base. But we showed in 
Lemma 23 that for each prime % in K we can find an @ which has the 
corresponding local property; and using Lemma 24 this is good enough. 


Theorem 20 Let a run through those integers of K for which K = k(a) 
and let ¢(X) be the minimal monic polynomial of a over k; then 0K/~ 1s 
the highest common factor of the ¢'(a). 


Proof Write 


sa) = BoX") + ByXP-7 4+ Bat, 


so that the G, are in o,[a] and fo = 1. In the algebraic closure k 


o(X) ay “ 
BE as f <pc<n-l 
San Pav) xX or OgywKn 
where the sum is over the roots a, of ¢(X) = 0; for the difference of the 
two sides is a polynomial of degree at most n — 1 which vanishes at the n 
values X = a,. Equating coefficients of X? gives 


Bn-1-p 1 if H=pPp 
Hs = , 
TK/k (« ¢'(a) 0 otherwise; 


hence the Gn_-1—p/’(@) are linearly independent over k and Tr(a#{) is in 
o, for p = 0,1,...,2—1 if and only if @ is in the o,-module S spanned 
by the Bn-1-,/¢’(a). It follows that S D Dik however, S C (1/¢’(a)), 
so 0x/k > (¢/(@)) for each a. If 1a,...,a@"~! are a base for Ox as an 
o,-module, this argument gives 0x, = (¢’(a)) because fp = 1. 

By Lemma 23 with $ = 81, we can choose a@ so that o,[a] = Op,. The 
construction ensures that a is not in 1, and it allows us to require that a 
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is in 3, for all 4 > 1. Hence in the notation of Theorem 19, for each p > 1 
the constant term of ¢,(X) is divisible by ,, and hence by p. Since $,,(X) 
is irreducible in kp, it follows from Lemma 18 that all the coefficients in 
¢.(X) except the leading one are divisible by p; and hence ¢,,(q) is not 
divisible by %,. This implies 


¢'(a) and ¢4(q@) are divisible by the same power of 1. (40) 


But ¢;(X) is the minimal monic polynomial for a over kp, where «@ is 
regarded as an element of Ky,; so an argument like that in the previous 
paragraph shows that the local different for Ky, /kp is equal to (¢{(a)). 
The theorem now follows from (40) and Lemma 24. QO 


Theorem 21 Let p,f be prime ideals in algebraic number fields k, K re- 
spectively with K > k and let e > 0 be given by P*\lp. Then Pe 0x4; 
and P°|dx/, if and only if ple. ; 


Proof Suppose temporarily that a is in $B, in the notation of Theorem 
19. Then the constant term in ¢,(X) must be divisible by P,, and hence 
by p. As in the proof of Theorem 20 every coefficient of ¢,,(X) except the 
leading one must be divisible by p; for otherwise ¢,,(X) would be reducible, 
by Lemma 18. In particular 


ainB => p divides Try, /x, (a). (41) 


Now let 6 be any element of $1~¢; if p|l7 then 7 is in 8, for every p, 
so that (39) and (41) imply p|Trx,/,(7@) whence Trx/,(@) is in op. Thus 
seo! divides OK/k- 

Now revert to the assumption that a is in 0, and denote reduction 
mod by a tilde. Then & is a root of ¢,(X) = 0; let its minimal monic 
polynomial over o/p be ¥(X). Thus y; divides ¢; and, using Lemma 18 
again, ¢ must be a power of ,. But deg; divides fi, the degree of O/P 
over o/p, and deg ¢; = e;f1; so ¢; must be an e)-th power. If ple, the 
second coefficient of pu must therefore be 0, whence 


ain Dandple => p divides Try /x,(@); (42) 


using this instead of (41) we deduce that P°|0x/,. If instead p fe; we can 
choose @ in D so that & generates O/B over o/p and the second coefficient 
of #1, which is ~Tr(@), is non-zero. But the first of these properties implies 
that $1 = wy! , and the second property then shows that p does not divide 
Trig /kp (@); thus P° does not divide 0x¢. QO 
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Corollary In any extension K/Q at least one prime p ramifies. 


Proof By the Corollary to Theorem 10, |dx| > 1; hence 0x /q # (1) by (37). 
But note that in general there is no corresponding result for extensions K’/k; 
for more information see §17. 0 


The lemma and corollary which follow will be useful in §13. 


Lemma 25 Let Ki, Ka be Galois over k, and write K = K,K». Then K 
and KK, Ke are Galois over k, 


[K : Ki] = [Ke: Kin Kd], 
and 
Gal(K/(Ky ia) K2)) ta] Gal(Ky/(K1 ia} K2)) x Gal(Ke/(Ki N K2)). 


Proof Any element of K is a rational function of elements of Ky and K». 
Any embedding K — & restricts for i = 1,2 to an embedding K; — k 
which is by hypothesis an automorphism of K;; so any image of an element 
of K is a rational function of elements of K; and Ko, and therefore lies in 
K. Thus K is normal over k. Similarly an element of K, Ko has all its 
conjugates over k in both K, and Ko, and therefore in K,  K2; so the 
latter is normal and even Galois over k. 

Now choose @ in K» so that Ky = k(a), and let f(X) be a minimal 
monic polynomial for a over Ki K2. Clearly a generates K over Ky, so 
K is separable over K; and hence over k. Moreover f(X) is irreducible 
over Kj; for if we had f(X) = fi(X) fo(X) in Ky with the f; monic, then 
the coefficients of the f; would be in K,; by hypothesis and in K2 because 
they are combinations of conjugates of a over k. Hence they would be in 
K, Kg and f would not be irreducible. But now 


[K : K)] = deg( f) = [Ke 7 Ky N Kol. 
In view of what we have already proved, the natural map 
Gal(K/(K1 nN Ko)) > Gal(Ki /(Ki ia} K2)) x Gal(Ke/(Ki N K)) 


is an injection, for if we know the effect of ¢ : K — k on K, and Kz we 
know it on K. But we already know that both sides have the same degree; 
so it is an isomorphism. QO 
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Corollary Let K,, Ko be Galois over Q with coprime discriminants, and 
write K = K,K>. Then 


KiNK2=Q, [K:Q]=[Ki:Q]-[K2:Q], xq = 2K,/Q2k/Q 


and a base for the integers of K over Q is given by the product of bases for 
the integers of K, and of Ka over Q. 


Proof It follows from (38) that 2x,9K,/Q divides both 0%,/q and 0%,/Q; 
since these are coprime by hypothesis, 0x,4x,/q = (1). By the Corollary 
to Theorem 10, this can only happen if K; N Ky = Q; this also gives the 
relation between degrees. 

Now suppose that @ is in Kz and also in Ky Ky If € is any integer in 
Ko we showed in the proof of the lemma that the conjugates of G& over Ky 
and over Q are the same; hence Trx,/Q(8§) = Trx;x,(G€) is an integer, 
and it follows that @ is also in on /Q° Thus 0x x, divides 0x,/q, and it 
follows from (38) that 8K/Q divides 9k,/Q°k2/Q: So 9x,/Q9K2/Q =2K/Q 
will follow from the previous sentence combined with dj? dx’, = dx, where 
n, = (K; : Q]; and this in turn will follow from the assertion about integral 
bases. Let @),...,Qn, be a base for the integers of K; and (i,..., Bn, a 
base for the integers of K2; then the a,@; span K as a Q-vector space and 
are integers, so they generate a subgroup of index m in the Z-module of 
integers of K. Let o1,...,0n, be the elements of Gal(K/K2); as in the 
proof of the lemma, their restrictions to K; are the elements of Gal(K,/Q). 
Let ¥1,--. ,n, be elements of Kz such that 5° ya; is an integer. The 
equations 


719501 ++-+ + Yn17jOn, = integer (1 <j <4), 


for the 7;, have determinant ,/dx,; thus each 7;,/dx, is an integer, whence 
m divides some power of dx,. A similar statement holds for dx,; and since 
dx,,dx, are coprime, m = 1. 0 


9 Idéles and Adéles 


In discussing ‘local-to-global’ problems it is often necessary to consider 
several different v-adic fields simultaneously, where each v may be either a 
finite or an infinite place. The natural language for this is that of adéles 
and idéles; we can use this to express some of our previous results in more 
elegant terms, but its real importance is in the higher reaches of the theory. 

At first sight it might seem natural to form the product of all the k,; 
this certainly is a topological ring, and it contains all the elements which 
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could interest us. But there are two things wrong with it. It has no 
satisfactory compactness properties since the individual factors are only 
locally compact; and most of its elements have not even the most superficial 
claims to be in the image of the natural map k — [] ky, for we know that 
any @ in k is a p-adic integer for almost all p. This last remark gives us a 
strong hint about the set which we ought to be considering, and it turns 
out that there is only one topology which we can reasonably impose on it. 

An adéle is an element of the set-theoretic product []k,, subject to 
the condition that if a = []a@, is an adéle then a, is in 0, for all but 
finitely many v. We shall usually denote adéles or idéles by bold Greek 
letters. They are both examples of what in a more general context is called 
a restricted direct product. 

The adéles form a commutative ring V, under componentwise addition 
and multiplication. We give V, a topology by taking as a base for the open 
sets the [|] Uy, where each U, is open in k, and U, = 6, for all but finitely 
many uv. It is easy to verify that this does define a topology, that the ring 
operations are continuous, and that the subspace []o, is open and the 
induced topology on it is just the product topology; hence J] o, is locally 
compact and so is V,. 

The diagonal map is the map k — V, defined by a+ [J] a; it enables 
us to identify k with a subset of V,. The elements of its image are called 
the principal adéles. This map induces on k the subspace topology; and 
if we view k and V, simply as additive groups we can form V,/k and endow 
it with the quotient topology. 


Lemma 26 With the conventions above, V;,/k is compact and k has the 
discrete topology. Moreover [] |la||y =1 for any a £0 in k. 


Proof Theorem 7 shows that given any adéle [] a, there exists a in k 
such that each a, — @ is in oy. In other words, every coset of k in Vi 
meets []’ k, x []” op where the first product is over all infinite places and 
the second over all finite ones. But ([]'ky)/o% is compact, so there is a 
compact subset S of []’ k, which meets every coset of 0;,. Thus every coset 
of k in V; meets S x []” Op, and the latter is compact; hence so is V;,/k. 

To prove that k has the discrete topology, it is enough to show that there 
is a neighbourhood of 0 in V; which contains no other point of k. But the 
set defined by 


[au||y < 1 if v is an infinite place, 
a, is in 0, if v is a finite place 
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is an open set containing 0; and it contains no other element of k by the 
Product Formula. QO 


The proof that V./k is compact, like the results which lead up to it, 
does not depend on the Product Formula; so we can now give a new proof 
of the latter, which is more highbrow than the one on page 35 but does 
explain why such a formula exists. For V; is locally compact and therefore 
possesses an essentially unique Haar measure, which must be the product 
of the Haar measures on the factors. Now let a be any non-zero element 
of k; multiplication by a defines an automorphism of V,; which maps k 
onto itself, so it induces a homeomorphism of V,/k. Because V;,/k has 
finite non-zero measure, this homeomorphism is measure-preserving. But 
the action of a on k, multiplies the Haar measure on k, by |lal|y, by the 
definition of the normalized valuation; so it multiplies the measure on V; 
by [1 Halle. 

Lemma 26 depends crucially on using every place v in forming the prod- 
uct. Let V” be formed in the same way as V; but leaving out the factor 
corresponding to w; then it is conjectured (but not proved in general) that 
k is dense in V™. 

For the argument above, we did not need to normalize the Haar measure 
on Vx; but for the following corollary we do. The normalizations we need 
are those described in §A1.3. 


Corollary With these normalizations, V,/k has measure 1. 


Proof As in the proof of the lemma, Vi,/k ~ (([]'ky)/o) x [] op. The 
measure of the second factor on the right is [](Normd,)~!/? = |d,|7!/2, 
and the calculation on page 3 shows that the measure of the first factor is 
\dj,|1/?. QO 


The invertible elements of V, form a group, called the idéle group ;; 
thus an idéle is an element a = [] a, such that a, # 0 for all v anda, isa 
unit for all but finitely many v. There is a natural map J, — J, where J, is 
the group of ideals; it sends a = [] a, to (a) = [] p”* where p”* |lap, and it 
extends the natural map k* — I. (The infinite places play no part in this 
map.) But although J; is a subset of V, we must not give it the subspace 
topology, for € ++ €~’ would not be continuous in that topology. (Take 
k = Q and let a?) be the adéle with al? _ p and all other components 1; 
in the adélic topology a?) — 1 as p > 00, but (a ?))~! does not tend to a 
limit.) Instead we give J, the topology induced by regarding it as a group 
of operators on the additive group V;: that is, a base for the open sets in 
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J, is given by the [] U, where each U, is open in k* and U, = o* for all but 
finitely many v. This topology is strictly finer than the subspace topology, 
' go the inclusion J; — V, and the multiplication map J, x Vi + Vi are 
continuous; and with it J, is a locally compact topological group. As with 
adéles, there is a natural map k* + J, defined by a + []a; it is called 
the diagonal map, and enables us to identify k* with a subset of Jk. The 
elements of its image are called the principal idéles. This map induces 
on k* the subspace topology; and we can form J;,/k* and endow it with 
the quotient topology. 


Lemma 27 The group k* is a discrete subgroup of Jy. 


Proof The topology induced on k* as a subset of J; is finer than that 
induced on it as a subset of V;; and the latter is already the discrete 
topology, by Lemma 26. 0 


The analogue of the first statement in Lemma 26 is much deeper than 
that lemma, and to state it we need a further definition. The map J, — R* 
given by a = [Jay + |e] = [] llau||. is well-defined because almost all 
the factors on the right are equal to 1; and it is a continuous epimorphism. 
Its kernel Jj contains k* by the Product Formula. For Jj we no longer 
have the nuisance of having two distinct induced topologies: 


Lemma 28 J} is closed both as a subset of J, and as a subset of Ve, and 
the two induced topologies on it coincide. 


Proof To prove that Jj is closed in Vi, take any [] ay in Vi, but not in Jj; 
we shall construct a neighbourhood of it in V; which does not meet Jj. 
Write C = J] |la.|ly, where the product must either converge or diverge tu 
0, because it only contains a finite number of terms which exceed 1. Since 
C #1 by hypothesis, there are two cases to consider. 

First suppose that C > 1. For each infinite place and for those finite 
primes p for which either Normp < 2C or |lap||p 4 1 (of which there can 
only be finitely many because the product for C' converges) let S, be a small 
open neighbourhood of a, in ky. Now []’ |a,||y = C where the product is 
taken over these v; so we can choose these neighbourhoods so small that 
if By is in S, for each such v, []'|/Gu||, < 2C. For any other v we take 
Sy, = oy, so that @, in S, implies that either ||G,|], = 1 or [Gully < 1/2C. 
If the latter possibility ever happens, then J] ||Gu||, < 1. Thus [] Sy is open 
in the adélic topology, contains [] a, and does not meet J}. 

If instead C' < 1 choose a finite set of places, including all infinite places 
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and all primes with ||ap||p > 1, such that if C’ is the product of the ||ay||,, 
taken over these places then C’ < 1. For each such v let S, be a small open 
neighbourhood of a, in ky; these neighbourhoods are to be so small that if 
Gy is in Sy for each such v then $(C’ +1) > [] ||Gully where the product is 
over this finite set of v. For any other v take S, = 0,. Thus [] Sy is open 
in the adélic topology, contains [] ay and does not meet J}. 

Thus Bie is a closed subset of Vz. It is closed in J; too, because the idélic 
topology on J; is finer than the restriction of the adélic topology. 

For the last assertion in the lemma it is enough to show that any J;,- 
open subset of J} is V,-open, the converse being trivial. Now let S = [] S, 
be any basic J,-open set; we need to find a V,-open set S’ such that 
SN Ji = 8’ Jj. By writing S as a union of smaller basic open sets if 
necessary, we can assume that each S, is bounded; since for all but finitely 
many vu we have S, = 0% and therefore ||a,||, = 1 for all a, in S,, there is 
a constant C such that J] |law||u < C for all [] a, in S. Now write 


si a J % if p is finite, S, = of and Normp > 2C, 
v™ | S, otherwise. 


Since the first of these happens for all but finitely many p, S’ = []S, is 
open in Vz; and S’N J} = SM J} as in the first part of the proof. Oo 


Theorem 22 J} /k* is compact. 


Proof As before, denote by J, the multiplicative group of non-zero 
fractional ideals of k, and endow it with the discrete topology. Because 
I,,/{image of k*} is finite, there is a finite group of translates of [] 0% 
whose union meets every coset of k* in J}. By Theorem 11 there is a 
closed bounded set S in []’ k*, where the product is taken over the infinite 
places, such that any coset of k* in J} which meets []’k3 x []”’ of also 
meets S x ]” o;. But this last set is the product of compact spaces and is 
therefore compact. 0 


To prove Theorem 22 we used the finiteness of the ideal class group and 
the structure of the group of units. Conversely, from an independent proof 
of Theorem 22 we can immediately deduce these two results — which are 
the key structural theorems of the elementary theory. For such a proof, see 
Chapter II of [CF]. 

We specified the measure on J, in §A1.3, and the measure on Jj now 
follows from the exact sequence (algebraic and topological) 


0-Ji > J, + R* 0. 
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In this case we can be very explicit, though uncanonical, because the exact 
sequence splits. Let u be a fixed one of the infinite primes, and let a be 
any idéle. There is a unique @ in Jj which is the same as @ in all but 
the u-th component and for which a,/, is real and positive; and the map 
@: @+ @ is a continuous homomorphism. Given a set S! in J}, let S be 
that part of its inverse image under ¢ which satisfies 


0 < log |la|| = log [ | lau|ly <2 


and define the measure of S! to be the measure of S. It is easy to see that 
this gives a Haar measure on J, which does not depend on the choice of 
u. 


Theorem 23 [f the measure on J, is the product of the measures above 
on the individual k* then the measure of J} /k* is 2"(2n)"?hR/w where w 
is the number of roots of unity in k*. 


Proof The map J, — J, induces an epimorphism from J? to the ideal 
class group; if we denote the kernel of this epimorphism by S! then it 
is enough to find the measure of S'/k* and multiply by A. Each coset 
of k* in S! contains elements of the set S* consisting of the idéles in S! 
whose components at each finite prime p are in 05; and these elements 
are determined up to an element of o*. Denote by S? the elements of the 
finite product []’ a, such that []’ ||au||y = 1, where the products are taken 
over the infinite places. There is a natural measure on S’ induced by the 
measures chosen on R* and C*, and the measures of S!/k*, of S*/ot and 
of S’/o% are all equal since S! = S? x [] 05. Now let m,-.. ,mr:+r2—-1 be 
a base for the units modulo roots of unity; given any []’ ay in S’ we can 
define real numbers 21,... , -,4r,—1 by the equations 


log ||aullv = £1 log |lm|lu + +++ + Sry tr2—1 Log [lM 4r2-1 lv 


for each Archimedean v. Each coset of 0* in S’ has just w members in 
the region given by 0 < z; < 1. But if v is real, going from da,/||a,||. to 
ad(log ||az||v) gives a factor 2 to take account of the sign of ay; and if v is 
complex, dz A dz/|z|? = d(r?) Ad6/r? in polar coordinates, and integrating 
over 6 gives a factor 27. The measure of S’/o% is therefore 


2" (2n)"w-! i 2 / TT 400g llerlls) 


where the product is taken over all but one of the Archimedean valuations 
and the multiple integral is taken over the region given by 0 < x; < 1 
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for each 1. Going from the log ||ay||, to the x; gives a further factor | RI. 
Combining these factors, we obtain the theorem. Ey 


3 


Special fields 


Most of this chapter illustrates the general theory in Chapters 1 and 2 by 
means of applications to particular kinds of field. However, many of the 
results in §12 and §13.1 depend on analytic properties of the zeta function 
which are not stated until §14 and not proved until §15. The standard 
calculations on Q( %/1) in §13 are also used to prove a particular case of 
Fermat’s Last Theorem, due to Kummer. 


10 Quadratic fields 


The quadratic fields are just the fields k = Q(,/m) where m 4 1 is a 
square-free integer. Since 1, /m are integers which form a base for k as a 
Q-vector space and A?(1, \/m) = 4m is not divisible by any square other 
than 4, the Z-module spanned by 1, ./m either is 0 or has index 2 in o. 
Since $ and 4/m are clearly not integers, the latter case happens if and 
only if a = (1 + ./m) is an integer. The minimal equation for a over Q 
is X? — X — $(m — 1) = 0, so a is an integer if and only if m = 1mod 4. 
Thus 


d= m if m=1mod4, 
~ ) 4m if m=2 or 3mod4. 


Both cases are covered by the statement that the integers are those numbers 
of the form 4(z+yWVd) for which x, y and $(x?— dy”) are in Z. The primes 
which ramify are those which divide d, and ramification can only mean 
(p) = p? with f = 1. Any other prime either splits (that is, (p) = p’p” with 
f = 1 for each factor) or remains prime (with f = 2). To test which, we 
use Theorem 19 and Lemma 19; for X? — m factorizes in Q, if and only if 
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X* —m=0 has a root in Q,. Thus when 2 does not ramify 


(2) splits in k if m = 1mod 8, 
(2) remains prime in k if m = 5 mod 8; 


and if p is an odd prime which does not divide m, 
(p) splits in k if and only if (7) =1 


where the latter bracket is the quadratic residue symbol. 

If m > 0 then k is real and the only roots of unity which it contains are 
+1. If m <0 all units in k are roots of unity, and they correspond to the 
solutions of x? — dy? = 4 in Z. It is now easy to see that k contains six 
roots of unity if d = —3, four if d = —4 and two otherwise. 

There is a close relation between ideal classes in a quadratic field k and 
classes of binary quadratic forms 


aX? +bX\X2+cX? with b?-4ac=d (43) 


under the unimodular group. For let a be an non-zero integral ideal of k 
and let a1,@2 be a base for a as a Z-module. Any a in a has the form 
Q@ = 210, + L2Q2 with x1, 2x2 in Z, which implies 


norm @ = (11Q + X2Q@2)(X19a1 + Z20Q2) 


where ¢ is the non-trivial element of Gal(k/Q). The right hand side is 
a quadratic form in 2,22 all of whose coefficients are rational integers 
divisible by a- aa = (Norma), and its discriminant is 

2 


Be ee = (Norm a)?d,. 


oa, aad 


Dividing by Norma we get a quadratic form (43). We can change the base 
for a as a Z-module by means of an integral unimodular transformation on 
21,22. If we start from another ideal in the same class — say a’ = (f)a for 
some @ in k* — then we can take Ga,, Baz as a base for a’; this yields the 
same quadratic form as before. Conversely if we write 


aX? + bX,Xo + cX$ = a(X1 + AX2)(X1 + AX2) 


a straightforward calculation shows that a,aX are a base for an ideal of 
k and this ideal gives rise to the quadratic form which we started from. 
However, the quadratic form has two linear factors and therefore gives rise 
to two ideal classes, which are in general distinct. 

When m < 0 there is a simple way round this: we fix an embedding 
k — C and require the base a, a2 to satisfy S(a2/a,) > 0. This imposes 
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an ordering on any base for a as a Z-module, and restricts us to integral 
transformations on x1,Z2 with determinant +1. Now there is a one-one 
correspondence between ideal classes in k and equivalence classes of binary 
quadratic forms (43). This gives the easiest way of computing class numbers 
when m < 0. For after a suitable linear transformation we can assume that 
a is the least value taken by the quadratic form (43) and that, subject to 
this, c is as small as possible; in this case we say that (43) is in reduced 
form. This gives c > a and |b| < a, whence a < \/—d/3. Conversely, if 
these inequalities hold it follows easily that a and c have the properties 
in the previous sentences. We can obtain a unique representative of the 
equivalence class of quadratic forms by writing X; + X»2 for X, if b = —a, 
or X9,—X, for X1,X2q if c = a and b < 0; in this way we can require 
a>b>-a,c2>aandb>0ifc=a. If B = ag/a, these conditions are 
equivalent to 3 > 36 > —4, |B] > 1 and SG > 0 if |B] = 1. The reader 
who knows about elliptic modular functions will recognize the fundamental 
domain of the modular group. 

To compute the ideal class number, we let b run through all values with 
|b] < /—d/3, where b is odd or even according as d is; for each such b 
we factorize ac = 4(b” — d) and list the triplets (a,b,c) with a > b > —a, 
c>aand 620 if a=c. There are h of these triplets, where h is the class 
number of k. The Brauer-Siegel Theorem (76) says that logh ~ 4 log|d| as 
d — —oo, but the known effective results are much weaker. There is a law of 
composition of quadratic forms (43) which corresponds to multiplication of 
ideal classes, but for computational purposes there is little to be gained by 
using it. There is a remarkable conjecture, supported by both theoretical 
arguments and numerical evidence, that the odd order part of the ideal 
class group is cyclic for 97-7% of all values of m < 0. (See [Co], §5.10.) 


Theorem 24 Let m <0 and suppose that t distinct primes divide d. Then 
there are exactly 2'-! elements of order 1 or 2 in the ideal class group of 
k. 


Proof Because a- ca = (Norma) is principal, we need to count the number 
of ideal classes fixed by o. But if a1, a2 are a base of a with S(a2/a1) > 0 
then ca 1, —ca2 are a base of ga with $(—ca2/oa1) > 0. Suppose that the 
reduced binary quadratic form corresponding to a is aX i +6X,X_94+ cx. a 
then oa corresponds to aX? —bX,X2+cX3 and this is also reduced. But a 
and oa are in the same ideal class if and only if these forms are equivalent, 
and it is easy to see that this happens precisely for forms of the shape 


aX? 4+¢X?, aX?+aX1X2+cX? or aX? +6X\X2+0X}. 
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In each of these cases the ideals a = (a,a3) and oa = (a,~ac), where 
af? + 68 + c= 0, are not merely in the same class but equal. Hence each 
ideal class in k of order 1 or 2 contains at least one ideal which is a product 
of ramified primes and rational primes, and clearly we can ignore the latter. 
Conversely, the square of any ideal which is a product of ramified primes 
is principal. 

It remains to discover which of the 2¢ ideals generated in this way are 
themselves principal. Each of the latter corresponds to two pairs of integers 
z,y and —z, —y such that 4(x? — dy”) is square-free and divides —d. This 
implies x? ~ dy? < —4d, so that |y| < 2. Apart from the obvious solutions 
(+2, 0) and (0, +2), the latter requiring 4 Jd, we need only consider y = +1. 
Now x? — dy? = —4d/r for some r requires 1 < r < 4; since d = —2 is 
not allowed, the only solutions are given by x = 0 when 4d, z = +1 when 
d = —3, zc = 2 when d = —4, and x = +3 when d = -3. Thus for 
every value of d we obtain just two principal ideals of this kind: these are 
(1+ /—1) and (1) when m = —1, and (/m) and (1) in all other cases. 0 


When m > 0 the situation is more complicated. By Theorem 11 the 
group of units is the product of {+1} and an infinite cyclic group; any of 
the four units which generates the latter is called a fundamental unit. 
There are various ways of defining a reduced form (43), the simplest being 
to require |c| > |a| > |b|; for given d there are only finitely many reduced 
forms, and they can be listed because a? < 4d. But each equivalence class 
will contain a number of reduced forms, this number being large if the 
fundamental unit is large, which it usually is. Moreover, one wishes to 
find both the class number and the group of units. For hand calculation, 
the better way to proceed is illustrated in Examples 4 to 6 below. Here we 
take advantage of the estimates quoted on page 20, whose proof is sketched 
in the exercise at the end of this section. The analogue of Theorem 24 is 
that there are 2'~! elements of order 1 or 2 in the ideal class group if the 
fundamental unit has norm ~—1, and 2¢-? if it has norm +1, but the proof 
is considerably more complicated than that of Theorem 24. Subject to 
this, the class number is usually small; for example it is conjectured that 
the class number is not divisible by any odd prime in 75-4% of all cases. 
But even the assertion that h = 1 infinitely often is only a long-standing 
conjecture. 


Example 1 m = —14. Now d = --56 and the reduced quadratic forms (43) 
are 


X2+414X2, 2X?47X2, 3X242X,X0+5X2. 
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Thus A = 4. The ideal class group must be cyclic, by Theorem 24; and this 
example shows that we cannot strengthen the conclusion of that theorem 
to 2&lih. 


Example 2 m = —35. Now d = —35 and if we take account of equivalences 
the reduced quadratic forms are 


X?4.X,Xo+9X2, 3X24 X1X04+3X2, 


so that h = 2. The odd prime p splits if and only if (~3°) = 1; such a p is 
a product of principal ideals if p = X? + X1X2+9X@ is soluble in integers, 
and of non-principal ideals if p = 3X? + XX2 +3X? is soluble in integers. 
(Just one of these must happen.) The former requires p = +1mod5 and 
the latter requires p = +2mod5. But to have so simple a rule as this 
depends on having h = 2*~! in the notation of Theorem 24. 


Example 3 m = ~131. Now d = —131 and if we take account of equiva- 
lences the reduced quadratic forms are 


X? 4 X1X_4+33K2, 3X74 X.X2+11X}?, 5XP43X1X24+ 7X2, 


so that h = 5. Here (2) is prime, (3) = pgp3 and (5) = pip?, where we can 
name the factors so that 


¥—131 = 1 mod p3 and V—131 = 2modpy, 
V-131 = 2modps and V—131 = 3 mod pg. 


There is no principal ideal with Norm 3, 5 or 15 because the first quadratic 
form does not represent any of these numbers; so the four ideals we have 
produced through factorization must between them represent the four non- 
principal ideal classes. Hence there must be a principal ideal of Norm 45, 
and one such is (3(7 + —131)). This is not divisible by p3 or pj, so it 
must factor as p43 2p whence pj is in the class of py 3. Similarly (7) = p5p7 
where /—131 = 3modpy and /—131 = 4modp?; and consideration of 
($(3+ /—131)) whose Norm is 35 shows that p/ is in the same ideal class 
as pr. 

Example 4 m = 10. Now d = 40 and ,/40/5 < 3, so any ideal class 
contains an integral ideal of Norm at most 2 and we need only look at how 
2 factorizes. We know that (2) = p2; moreover pz is not principal because 
X? —10X?2 = +2 is insoluble in Zs and hence in Z. Thus h = 2. By 
inspection 3+ /10 is a unit, and it is the fundamental unit because it gives 
the least non-trivial solution of X? — 10X? = +1. 


Example 5 m = 229. This has been chosen because it is the first case 
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when A is not a power of 2. Now d = 229 and ,/229/8 < 6, so using the 
strong result quoted at the end of §3 any ideal class contains an integral 
ideal of Norm at most 5. Thus we need only consider the factorization of 2, 
3 and 5. Here (2) is prime, (3) = pp¥ and (5) = php¥. But $(13 + /229) 
has norm —15, so after renumbering if necessary it must be equal to p4p5; 
and pf is in the same class as pj. By inspection (15 + /229) is a unit; it 
is actually a fundamental unit, but we shall not need this fact. To prove 
that h = 3 it is enough to show that pi” is not principal — that is, that all 
integer solutions of X? + X,X_2 —57X2 = +9 have X),X2 both divisible 
by 3 and thus give elements of k in the ideal (3). This time congruence 
arguments do not help. But if the equation has a solution r,, 72, we can 
multiply (x1 + 4(1 + V229)x2) by a power of the known unit 4(15 + /229) 
to ensure that 


3 < |x + $(1 + V229)z9| < 3 x 4(15 + V229); 
this implies also 
3 x $(V229 — 15) < |z1 + $(1 — V229)z2| <3 


This process does not affect whether x1, x2 are both divisible by 3. These 
inequalities define a bounded search region, and we find that within it there 
are no pairs 1, £2 of the kind we are looking for. Of course, we can make 
this process more efficient; but that does not change the underlying idea. 


Example 6 m = 73. This is a case where the fundamental unit is large 
enough for one to need an efficient process for finding it. Now d = 73 and 
73/8 < 4, so using the strong result quoted at the end of §3 any ideal 
class contains an integral ideal of Norm at most 3. But (2) = pop) and 
(3) = pgp, so after renaming we can require that ($(7+ V73)) = php. By 
considering Norms we obtain 


(4(1 + V73)) = pips”, (2(3 + V73)) = pi, 
(2(5 + V73)) = pps, (4(7 + V73)) = pops, 
(3(9 + V73)) = pg, dare vm pe ps: 


These equations are enough to show that p5,p3,p3,p4 are all principal, so 
h =1. Moreover 
24 


has trivial divisor, so that it is a unit. It is equal to 1068 + 125/73 and 
this is actually a fundamental unit. It might appear from the factorizations 
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above that one could form smaller combinations with trivial divisor, but 
there is a pitfall to beware of. For example 


(3(5 + V73))(3(7 + V73)) (3 (9 — V73)) 
12 
has trivial divisor; but it is just a complicated way of expressing 1. 


Lemma 29 Let p be an odd prime and p a primitive p-th root of unity. 
Let S => (5) p" where the bracket is the quadratic residue symbol and the 
sum is over the residue classes prime to p; then S? = (5!) p. 


Proof We obtain S? = Y(P)p"t? = ¥, (4) ¥, 0+” on writing 
s =rt. The inner sum is p if ¢ = —1, and 0 otherwise. oO 


Corollary Any quadratic field is cyclotomic — that is, it is a subfield of 
the field of all roots of unity. 


Proof After the lemma, it is enough to prove the result for Q(/—1) and 
Q(/2), for any quadratic field Q(,/m) lies in the least field containing 
V—1, V2 and the \/£p for the odd primes p dividing m. But /—1 is a 
fourth root of unity and (1 + /—1)/¥2 is an eighth root of unity. The 
corollary is also a special case of the Kronecker-Weber Theorem, for which 
see §20. O 


Theorem 25 (Quadratic Reciprocity) If p,q are distinct odd primes, 
then 


(2) () = (-1)@-DE-DA, (44) 
Proof Since F% is cyclic, (5) = r~))/? mod p. If S$ is as in Lemma 29 then 
sila (—1)-Y@-1)/4p(a-1)/2 = (-1)@-D@-v/4 (2) med: 

On the other hand, if we write s = rq, 
St = 50 (5) 0" = (3) D2 (a) 0° = (8) S mod g; 


dividing by S, which is prime to g, and combining with the previous equa- 
tion we obtain (44). o 


To obtain the auxiliary law (?) = (—1)*-)/8, consider T = (1 + i)? 
where i = /—1. For on the one hand T = 1 + 7? mod p; on the other hand 


T = (14 4)(21)®-Y/? = (1 4 i) Y/? (2) modp. 
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Exercise Let f = aX? + bXY +cY? with d = 6? — 4ac # 0 and a,b,c 
real have min|f(zx,y)| = M > 0 where z, y run through all pairs of rational 
integers not both zero. If the minimum M is attained, show that by an 
integral unimodular change of variables, and replacing f by —f if necessary, 
one can takea= M,0<b<M. 


(i) If d < 0 show that M is attained and d > 3M?. 

(ii) If d > 0 and M is attained, then split cases by considering f(1, 1). 
Show that f(1,1) > M implies that f = M(X? + XY — Y?) and 
d=5M?. If f(1,1) < —M then f = M(X? — 2Y”) and d = 8M? 
if f(3,-2) > M, while d > 221M? if f(3,-2) < -M. 


In (ii), can the argument be pushed further? What happens if M is not 
attained? 


11 Pure cubic fields 


These are fields Q(</m), where we can take m = m,m3 with m1, m2 co- 
prime, square-free and positive. Write 


3 3 
a, = */mym3, a2 = t/m2me 


so that 1,a1,@2 span k as a Q-vector space. We have 


A?(1, a1, @2) = —27m?m3. 


Theorem 26 In the notation above, d = —27m?m?3 and 1,a1,a2 are a 
base for o unless m, = +m2mod9. In the latter case z(1 + mya, +mM2aQ2) 
is also in o and d = —3m?mi. 

Proof If for example p # 3 is a prime factor of m; then (p,a1)? = (p), 
so that p ramifies; since p*||A? we must have p?|{d. If 3/m, a similar 
calculation shows that (3) = p3 with p3|[a, and p3|la2. Hence if co, c1, ce 
are in Q the exact powers of p3 which divide the non-zero summands in 
Q = Co + C101 + Code are all different, so that a can only be an integer if 
Co, C1, C2 are 3-adic integers. It follows that 1, a1, @ are a base for o if 3|)m, 
or similarly if 3|mz. If instead 3 fm,mz then d contains an odd power of 3 
and therefore 3 ramifies. Theorem 11 allows two possibilities: 


(3) = php? and d = —3m?m3, or (3) = p3 and d = —27m?m3. 
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By Theorem 19 the first case happens if and only if X? — m = 0 is soluble 
in Q3, and by Lemma 19 this happens if and only if m = t1mod9. This 
is the same as m; = +m2mod9Q. To find a base for the integers in this 
case, let p = %/m in Q3; then (a; — »)(a? + way + pw?) = 0, so Theorem 19 
implies that maa2 + wa, + yw? = 0 in ky where p = p4. Since p = m, mod3 
it follows easily that 1+ my,a 1 + mga2 = 0mod 3. o 


Now suppose that p /3m mz, so that p is not ramified. Since F; is cyclic 
of order p — 1, the equation X3 — 1 = 0 has one root in F, if p= 2mod3 
and three if p = 1mod3. Thus if p = 2mod3 then X? — m = 0 has one 
root in F, and that root has multiplicity 1; so it follows from Theorem 19 
that (p) = p’p” where Normp’ = p and Normp” = p?. If p = 1mod3 then 
x? — m = 0 has three roots in F, or none; and the theoretical criteria for 
distinguishing between these two possibilities are more complicated than 
direct calculation. In the first case (p) = p’p"p’”; in the second case (p) is 
a prime ideal in k. 


Example m = 12. Now 1,a, = V12,a2 = V/18 are a base for o and 


norm (cg + c1a1 + c2a2) = f(co,c1,C2) = cB + 12c3 + 18c3 — 18¢9c1c2. 


Since ,/—d/23 < 7 we need only look at the factorization of 2, 3 and 
5. We have already seen that (2) = p3, (3) = p3 and (5) = pp” where 
Norm pf = 5 and Norm pf = 5”. But 


f(2,1,1) =2 so that (2+ a1 + a2) = pa, 
f(3,1,1) = so that (3+ a1 + a2) = ps, 
ae 2,2) = sothat (5+ 2a; + 2a2) = ps, 
f(0,-1,1)=6 so that (a1 — @2) = paps. 


It follows from the first three of these that all ideals of Norm less than 7 
are principal; so h = 1. Moreover 
(2 +ay+ a2)(3 + a1 + Qe) 
a2-ay 


= 55 + 24a, + 2lag 


is a unit — and actually the fundamental unit. 


12 Biquadratic fields 
Let K be the biquadratic field Q(,/a1, \/a2), where none of a1, a2 and aa 
are squares, and write a3 = a,a2/m* where m? is the largest square which 
divides a;a2. Then K is normal over Q with Galois group C2 x C2, and 
K has three intermediate fields k; = Q(,/a;). Throughout this section 
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d,h, R and w will be the values for K, and the values for k,; will be denoted 
by the subscript 7. Neglect for the moment primes which ramify in K/Q; 
then e = 1 and the splitting group Z is cyclic of order f. We have two 
possibilities: 

(i) f =1, so that p splits completely in K and in each k,; 

(ii) f = 2, so that p splits in one k; and remains prime in the other two. 


Thus up to finitely many factors of the form (1 — p~£*)+! arising from the 
ramified primes, 


(Cq(s))?CK(8) = Gea (8)ka(8)Cka (8)- (45) 


We could check in the same way that the factors from the ramified primes 
also cancel, but it is less effort to argue as follows. Let 6(s) be the quotient 
of the two sides of (45), which we know to be the product of finitely many 
factors of the form (1 — p~/*)*!. Let ¥(s) be the quotient of the products 
of expressions (77) below corresponding to the two sides of (45); then 


_vQl-s) _ a(l-s) 1-20)/2 
l= eG) Bw) |d/dydqdg|~?9)/ 
where the left hand equality follows from the functional equation and the 
right hand equality holds because the terms in curly brackets in (77) cancel 
in w. In view of the shape of 9(s), this can only happen if everything 


cancels; so #(s) = 1 and up to sign 
d = didod3. (46) 
It is easy to check that the sign is also correct. Applying (75) to (45) gives 
AR = hyhohg Ri RoR3 x (4w/wiwews). (47) 


The last factor is easy to calculate, for 7/1 can only lie in K if o(n) =1, 2 
or 4 where ¢ is Euler’s function, by the Corollary to Theorem 27 below. 
The only even values of n satisfying this are those with n < 12; and we can 
rule out n = 10 because Gal(Q( '/1)/Q) + Cy. Now 


n=8 <=} K=Q(vV-1,V2) whence 4w/wiw2w3 = 2; 


in all other cases, including n = 12 for which K = Q(./—1, /—3), it is easy 
to check that 4w/wiwow3 = 1. 

Now let be a unit in K and denote by o; the non-trivial element of 
Gal(K/k;). Then 1 - o;7 is in k; and therefore in 07, the group of units of 
k,; and on multiplying these three expressions together we see that 


1? = 4n?normxqn = + [[@ on) 
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is in 0[0303. Denote by W the group of roots of unity in K; then it follows 
at once that 


1 or 2 if K is complex, 


eas Weinaes| { 1,2,4 or 8 if K is real. 


(48) 
Here we can delete W except when K = Q(/—1, V2). 

Suppose first that AK is complex; choose the notation so that k, is real 
and let 7 be a fundamental unit of K. Then wy‘ is a fundamental unit 
for k, where w is a root of unity in K, and hence R, = eR; taking into 
account the anomalous case K = Q(/—1, V2), (47) gives 


h = phihahs[Dj : of0303}. 
If instead K is real, a similar calculation gives 
h = Fhyhohs[O% : of 0303]. 


I know of no other way to prove such results; and study of particular cases 
suggests that there is no corresponding relation between the ideal class 
groups. 


13 Cyclotomic fields 


Let km be the field of m-th roots of unity. By a cyclotomic field I 
shall mean any subfield of any k,,; but the reader is warned that some 
authors restrict the phrase to the k,, themselves. It is in any case natural 
to start with the k,, themselves; the properties of their subfields can then 
be deduced by means of Theorem 16. The most interesting fact about 
a general cyclotomic field is that one can write down explicitly units of 
a particular kind (the so-called cyclotomic units) and that usually these 
generate a subgroup of finite index in its full group of units. Moreover, this 
index is the product of the ideal class number fA and factors which are easy 
to compute. 

In studying the k it is convenient to deal with the prime factors of m 
one at a time — that is, to deal first with the special case when m is a prime 
power and then, writing m = |] p;', to obtain the field k,, by composition 
from the various fields kyr using Lemma 25 and its Corollary. 


Theorem 27 Let p be a primitive p’-th root of unity, where p is prime; 
then N = [Q(p) : Q] = p"!(p—1) and the conjugates of p over Q are the 
distinct p" with n prime to p. The numbers 1, p,... ,p%~1 form a base for 
the integers of Q(p). Write M = p’~1(pr —r — 1); then the different of 
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Q(p) is (1 — p)™” and its discriminant is +p” where the sign is negative 
if p = 3mod4 or p” = 4 and positive otherwise. The roots of unity in 
Q(p) are just the tp". The only prime which ramifies is (p) = ((1—p)). 
There is a natural isomorphism between Gal(Q(p)/Q) and the multiplica- 
tive group of residue classes modp" prime to p, where the automorphism 
corresponding tonmodp" is on: p+ p”. If £ is a prime other than p, the 


Artin symbol is (N92) = 08. 
Proof Certainly p satisfies the equation 


XP 1 
W(X) = so5T T= 


of degree N, whose roots are precisely the p” with n prime to p. In partic- 
ular 


0 


[[@ - 6”) =¥@) =p. (49) 


But 1 ~ p divides 1 ~ p”; and if n’ is such that nn’ = 1 mod p” then 1 — p” 
divides 1 — po = 1-—-. Hence each factor in the product (49) is 1 ~ p 
times a unit, and in terms of ideals we have (p) = ((1~ p))‘. This implies 
that [Q(p) : Q] > N; and since consideration of p gives the opposite 
inequality it follows that [Q(p) : Q] = N, that y is irreducible over Q, that 
all the p” with n prime to p are conjugate to p, and that 1 — p is a prime 
ideal. Since an element of the Galois group is determined by its action on 
p, the Galois group consists of the o, : p++ p” with n prime to p and 
Tmnp = P™” = OmOnp. 

Except in the special case p” = 2 all embeddings of Q(p) in C are 
complex, and therefore r; = 0 and rg = 5N ; moreover we know that the 
sign of the discriminant is (—1)"?. Since 


r r-1 
[](o - o0) = vo) = 21)" ~1) 
where the product is taken over all the op other than p itself, and 
r~ r-1 r-1 
norma p)/q(pP?>_ 1) = +(norma, v/q( V1 —1))P =+p" 
by (49) in the special case r = 1, we find that 
A*(1,p,... ,p%~!) = tnorm y'(p) = +p™, 


which is the value asserted for the discriminant up to sign. In particular, 
the only factor we can take out of this is a power of p. To assert that the 
p” for 0 <n < N form a base for the integers is the same as to assert that 
the (1 ~ p)” form one. Thus to prove the assertions about the discriminant 
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and a base for the integers, it is enough to show that if the a, are in Q 
for 0 < n < N and are such that }°a,(1 — p)” is an integer then the 
a, are integers. But all the non-zero terms in this last sum have distinct 
(1 — p)-adic additive valuations, so each of them must be an integer. Since 
there is only one prime ideal in Q(p) which divides (p), the different is 
uniquely determined by the fact that its Norm is the ideal generated by 
the discriminant. Now suppose that there is a root of unity ¢ in Q(p) which 
is not of the form +p*; raising it to a power if necessary we can assume 
it is an £°-th root of unity for some prime ¢. Here dq.)|dqip) because 
Q(e) C Q(p); and since we have already shown that dq.) is divisible by £ 
this implies 2 = p. Comparing discriminants now gives s < r, contrary to 
hypothesis. 

Finally, the Artin symbol satisfies (AG/2 )¢ = ££ mod £ for every integer 
€ in Q(p). Taking € = p, it is evident that og is the only element of the 
Galois group which meets this condition. Oo 


Corollary Let p be a primitive m-th root of unity for some m > 2; 
then (Q(p) : Q] = ¢(m) where @ is Euler’s function. If n is prime to 
m then p” is conjugate to p over Q. The roots of unity in Q(p) are just 
the tp". The primes which ramify in Q(p) are just those which divide 
m, except that 2 does not ramify if 2||m. Gal(Q(p)/Q) is isomorphic to 
the multiplicative group of residue classes modm prime to m, where the 
element corresponding to nmodm is on : p++ p™. If £ is a prime not 
dividing m the Artin symbol is given by ( aye) = 08. 


Proof Recall that if m = []p;‘ then the value of Euler’s function is 
-1 
o(m) =m] (+ ) . 
plm 


Now everything follows at once from the theorem, together with Lemma 25 
and its Corollary. These also give us the value of the different and hence of 
the discriminant. The assertion about the roots of unity is proved in the 
same way as in the theorem. Oo 


For any prime p, let m = p™m’ where m’ is prime to p. If Am denotes 
the group of residue classes modm prime to m, there is a canonical iso- 
morphism 


Gal(km/Q) Am * Am! X Apr © Gal(km:/Q) x Gal(kpr /Q) 


and the primes above p in k, are totally ramified in km/km:. It follows 
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that the splitting group of p is C x Apr, where C C Am: is the cyclic 
group generated by p, and the inertia group of p is Ap-. The corresponding 
statements for any cyclotomic field now follow from Theorem 16. 


The following result will be needed both in §13.2 and in §20. 


Lemma 30 Let k = Q(¢) where ¢ is a primitive p-th root of unity, and 
write m =1—¢. If € is in og and prime to m, and if £ = a} modx? for 
some Q in of) then (m) is unramified in K/k where K = k( 4/6). 


Proof Suppose first that £ = a} mod 7?t!. In the notation of Lemma 20 
we have m = p — 1, so that we can take r = 2; hence £ = a? for some 
@ in 0/,), 80 that (a) splits completely in K by Theorem 19. If instead 
nm? ||(€ —a), we can assume that K # k, which indeed follows from the last 
assumption. Let n = (#/€ — ao)/7, so that K = k(n), and let f(X) be the 
minimal monic polynomial for 7; then 


f(X) =X? + (aB""p/m?-1)X + (ah — €)/n? mod r. 


It follows that 7 is in Ox; hence 04/4 is prime to 7 by Theorem 20, and 
so (7) is unramified in K/k by Theorem 21. It is not hard to show that in 
this case (7) remains prime in K. oO 


13.1 Class numbers of cyclotomic fields 


Let k be any cyclotomic field with [k : Q] = n and let m be such that 
k C km; it is proved in §20 that all fields abelian over Q can be obtained 
in this way. Let A,, be the group of residue classes mod m prime to m 
and H,, the kernel of the map A,, + Gal(km/Q) — Gal(k/Q) given 
by the Corollary to Theorem 27; we shall meet almost the same notation 
again in §17. Let x be any character of Aj;,/Hm and let f, be the exact 
conductor of x — that is, the least f such that x is defined modf; and 
write f, = Normf,. It will follow from Theorem 33 that |d«| = [] f,; 
alternatively, we can apply to both sides of (50) the functional equations 
implicitly described in (79) and (77) below. The key to what follows is the 
identity 


Ce(s) = [] L(s,x)s (50) 


which follows from the remarks after the proof of the Corollary to Theorem 
27. If xo is the principal character, Z(s,xo) is just the Riemann zeta 
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function (71). Multiplying both sides of (50) by s — 1, letting s + 1 and 
using (75) we obtain 
hae { atl d,.|}/2T]'L(1, x) if k is real, - 
(2n)-"/2wl\d,|!/27]'L(1,x) if kis not real, 1) 
where in each case the product is over all characters of Am/Hm except 
Xo. To justify this, we must show that the L(1, x) are well-defined; and to 
make it useful we must obtain a closed formula for the L(1,x), which we 
now do. Let x run through a set of representatives of the residue classes 
mod f, and let y run through those prime to f,. If ¢ is a primitive f,-th 
root of unity then 


L(s, x) = Oh 


because the innermost sum is f, if n = ymod/f, and 0 otherwise. By 
rearranging we obtain 


L(8,x) = fz"), {dx} (So ete}. 5) 


If c = Omod f, the first expression in curly brackets vanishes. But if 
xz #0mod f, then at least formally 


tim Yo n9G-"* = YC /n) = log 6-7) 53) 


where we choose that branch of the logarithm which has imaginary part 
strictly between —7i and mi. This last step can be justified by a standard 
theorem of Abel or by the exercise which follows. 

Exercise If |z| < 1, show by uniform convergence that 


lim) n-°z" = yoo = —log(1 — z). 


sl 


If also |z;| < 1 show that 


$(s, 21) = done = yim —(m+1)~°}(z1 +++ +2") 


and that, provided |z; — 1| > € for some fixed ¢ > 0, the right hand side is 
absolutely convergent uniformly in z; and s provided Rs > e. Deduce that 
lims1 $(s, 21) exists and is continuous in z,, and derive (53). oO 


The Gauss sum, which is the first expression in curly brackets in (52), 
satisfies 


ta(x) = >> x(y)67¥ = X(2)r1(x)- (54) 


For if x is not prime to f let r be the quotient of f by the highest common 
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factor of x and f. Replace y by yc where c = 1 modr; since cr = xmod f 
we obtain Tz(x) = x(c)Tz(x). Thus tz(x) = 0 because if x(c) = 1 for all 
such c, the conductor of x would divide r. If on the other hand x is prime 
to f then T2(x)x(z) = D1, x(zy)¢7¥ = 71(x). Moreover 


(x)ra(x) —* 2 x(z)x(y)¢7-¥ = =e Lx x(z)gre— 
on writing x = yz. Summing first over y gives |71(x)|? = 
It follows from (53) and (54) that 


x) = fg! SS te(x) log(1 — ¢-*) = - fe *r1(x) 5 X(2) log(1 - 677). 


Since [] fy = |dg|, the contribution of the fy 4171(x) to J]’|Z(1,x)| will be 
|d,,|~1/2 which will cancel with the {d,|!/? in (51). 

It is now necessary to split cases. We shall say that x is an even char- 
acter if x(—1) = 1 and an odd character if x(—1) = —1. Since complex 
conjugacy on k is induced by p++ p~! where p is a primitive m-th root of 
unity, it follows from the Corollary to Theorem 27 that all characters are 
even if k is real but half of them are odd if k is not real. If k is not real, it 
has a totally real subfield kp such that [k : ko] = 2; this is the field fixed by 
complex conjugacy, so the characters associated with ko are just the even 
characters associated with k. It follows from (51) that if k is real 


2” tar =[T [So x(z)log(t - ¢-7)], (55) 
whereas if k is not real 
h = 2ho{2"/?-?wRo/R}(2n)-"/" TT |S) x(z)log(—¢-*)| (66) 


where ho and Ro are the values for kp, the product is taken over all odd 
characters and the expression in curly brackets is equal to [of : of,]. For 
further information on this last expression, see Exercises 1.8 and 3.9. 

Suppose first that k is real. Let f be a factor of m; for what follows to be 
non-trivial, f must be a multiple of f, for some non-principal x. Let Hy 
be the image of H,, in Ay and ¢ a primitive f-th root of unity. Fix some a 
prime to f and not in Hy, and let c run through a half-set of representatives 
of the classes mod f in Hy, so that the cmod f and the —cmod f together 
represent all the classes in Hy. Then 


TD (600-7 - 29/1 = 6)} (57) 


is a unit in Q(¢) and is fixed by Gal(Q(¢)/kN Q(¢)) = Hy; so it is a unit 
in k, and it is not obviously trivial. The units of k composed from —1 and 
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these units are called the cyclotomic units of k. In general, it follows 
from Theorem 11 that there are multiplicative relations among the units 
(57) as a and f vary; but no one has sorted out the details. One should 
expect that the right hand side of (55) is the regulator of a suitable set 
of cyclotomic units; but this again is unproved. Both these are problems 
which defeated Hasse, so they will not be easy. All that one knows is the 
following. 

Let x be a character mod f, but not necessarily with conductor f, let ¢ 
be an f-th root of unity, and write 


= }> x(z) log(1 — ¢-*) (58) 


where the sum is taken over a set of representatives x of the residue classes 
mod f prime to f. For any prime p we can obtain a character y* mod fp 
by restricting the argument of x to be prime to p. Let £? = ¢; then 


(eS GS ae) Rae rey: | (59) 
If p|f then substituting (59) into (58) gives S(x,¢) = S(x*,€). If howevek 
pXf then we can write x = py in (58) and obtain 
S(x,6) = D> x(py) log(d — €-"¥) + S(x", 6) = x(p) 8 (x, 6) + 5(x",€). 
Thus in either case we have 
5(x",€) = (1 — x(p))5(x, 6): 


We can now rewrite (55) as 


aR {T] [0 -x@)} =[] [sox @tostt = 657)]. (60) 


Here ¢,, is a primitive m-th root of unity, the double product on the left is 
taken over all primitive characters x induced by characters x* of Am/Hm 
and over all distinct primes p|m, and the sum on the right is taken over a 
complete set of representatives x for the elements of A,,/Hm. Of course, 
this formula is only useful if no x(p) is equal to 1. 

The right hand side of (60) can be expressed as a regulator by means of 
the second part of the following lemma, due to Dirichlet. 


Lemma 31 With the notation above, let the B(y) be variables indexed by 
the elements of A/H. Then 


T1{S. x1) Bs) } = det(B(y.97") (61) 
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where the x? run through the characters of A/H and y;,y; run through the 
elements of A/H; and 


IT {30 xi@s) BGs) } = det(B (wuz) - Bey) (62) 


where the product on the left excludes the principal character and y;,y; on 
the right run through the elements of A/H other than H itself. 


Proof If we multiply the i-th row of the determinant in (61) by x*(y:) 
and add all the other rows to the first one, the term in the j-th column 
becomes x*(y;) >> x*(y)B(y); so the right hand side of (61), considered as 
a polynomial in the B(y), is divisible by > x*(y) B(y). Since this holds for 
each x*, the right hand side of (61) is divisible by the left hand side. By 
considering degrees, the quotient must be a constant; and by looking at 
the coefficient of (B(H))” that constant must be 1. 

We can renumber the y; so that y; = H. By the argument in the previous 
paragraph, the left hand side of (62) is equal to the determinant obtained 
from that in (61) by replacing the first row by (1,1,... ,1). Now subtract 
the first column from each of the others. Oo 


In each sum on the right of (60) we can write x = cy, where y runs 
through representatives of the elements of A/H and c runs through repre- 
sentatives of the congruence classes mod f in H. Thus 


DX (2) log = Ga") = DT x (wD loa = Gm). (63) 


Let oF ' be the element of Gal(k/Q) corresponding to the class of y; in 
Am/Hm. Write 


i II. {a ~ 09°) yf 05§°/€¢/(1 — «} = V/T1.{(1 - 03€°)/( — €°)} 


where []’ is taken over a half-set of representatives of the classes modm 
in H,, and |] is taken over a full set of representatives. Since ,/o,&/€ is 
an integral power of €, the n; are cyclotomic units in k, and m = 1 since 
a, is the identity. It now follows from (62) and (63) that the right hand 
side of (60) is the regulator of 7,... , Mn. 

We now turn to (56). Here every x is odd, so that 


S> x(x) log(1 — ¢77) = $ 5 x(x) {log(1 — ¢-*) — log(1 - ¢*)} 
= —nifS~ 2x2) 


where x runs through all the integers prime to f with |z| < }f. Taken 
with (56), this gives a formula for h/hp in which all the terms are rational 
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and easy to compute. That h/ho is actually integral is an easy consequence 
of class field theory; see Exercise 5.3. There are elementary proofs of this 
last statement, which are straightforward when k = k,, but complicated in 
general. 


13.2 Fermat’s Last Theorem 


A major motivation for studying k, was the possibility of applications to 
Fermat’s Last Theorem — the assertion that 


X"4Y"=2", XYZ#0 


has no solution in rational integers when n > 2. Since any integer greater 
than 2 is divisible either by 4 or by an odd prime, it is enough to consider 
the cases when n is either 4 or an odd prime p. The cases n = 3 and 
n = 4 were proved by Fermat; for two versions of the argument when 
n = 4, see Exercises 3.5 and 3.6. Although there is no way of knowing, it 
seems likely that the argument which convinced Fermat in the general case 
was of the same type as that given below, combined with the assumption 
of unique factorization. The oldest recorded version of this argument is 
due to Kummer in the mid-nineteenth century. He was able to make it 
rigorous, but the price was the additional condition that p should be a 
so-called ‘regular’ prime — that is to say, that the class number of Q(¢/1) 
is prime to p. Though most small primes (including all p < 37) are regular 
it is still not even known whether there are infinitely many regular primes. 

It is enough to assume that there are integers x, y, z with highest common 
factor 1 such that 


zP+yP=2?, ryz#0 (64) 


and derive a contradiction. Denote by ¢ a primitive p-th root of unity and 
write = 1—(¢. We separate cases according as p xyz or p|zyz. 

In the first case we can take p > 5, for if p = 3 each of z?, y?, z? would 
be congruent to +1 mod9Q and this contradicts the first equation (64). If 
p 25 we write (64) in the form 


(z+ y)(ot+Cy)--- (2+ OP 4y) = 2; (65) 


the factors on the left are coprime in pairs because they can have no com- 
mon factor other than (7), and by hypothesis (7) does not divide the right 
hand side. Hence each of these factors, viewed as an ideal, must be a p-th 
power. But since A is prime to p, an ideal whose p-th power is principal 
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must itself be principal. Hence each factor on the left in (65), regarded as 
a number, must be the product of a p-th power and a unit. In particular 


z+ Cy=aPu (66) 


for some integer a prime to 7 and some unit u. If we use a bar to denote 
complex conjugacy, then u/t is a unit all of whose conjugates over Q 
have absolute value 1; thus it is a root of unity and must have the form 
7 = +¢™ for some m with 0 < m < p. Alsoa = &mod 7 by considering the 
expression for a as an element of Z[p], and therefore (a/&)? = 1 mod pz. 
Comparing (66) with its complex conjugate, we obtain 


z+ ¢cy—n(z+¢-ly) =Omodpr. 


Multiplying this congruence by ¢ if m = 0, or by ¢? if m = p—1, we see 
that if m # 1 there is a polynomial f(T) in Z[T] of degree at most p—2, not 
divisible by p but such that f(¢) = 0mod pz. But now g(U) = f(1—U) has 
degree at most p — 2 and is not divisible by p, but g(7) = Omod pz; and 
this is impossible because the terms in g(z) all have distinct valuations. 
Thus m = 1 and n = +¢. But now y = tz modp. If the lower sign held, 
then x + y would be divisible by p, whence p|z contrary to hypothesis; so 
xz =ymodp. 

Applying the same argument to (—z)? + z? = y? gives —r = zmodp. 
Now substituting back into (64) gives p = 3, and this has already been 
ruled out. This completes the discussion of the first case. 


Nearly all the difficulty in the second case is in the proof of the follow- 
ing fundamental lemma. This was another of the precursors of class field 
theory. An elementary proof, which goes back to Kummer, is sketched 
in the exercise later in this subsection; in the text we derive the result 
painlessly from classical class field theory. 


Lemma 32 In the notation above, let « be a unit of k = Q(¢) such that 
€=al moda? for some a, in ox. Then either « = €? for some unit € ink 
or p\h. 


Proof Suppose that the first alternative does not happen. The p roots of 
X? —e form complete sets of conjugates, all sets being of equal size; so they 
are all conjugate and K = k(#/e) is abelian of degree p over k. But 0x/% 
divides p, by Theorem 20 applied to a = ~/e. Lemma 30 shows that (7) is 
unramified in K/k; so K/k is not ramified at any place. In the terminology 
of §17 this means that H contains the group of principal ideals, so the ideal 
class group has a quotient group isomorphic to Gal(K/k), whence pl|h. O 
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For the second case of Fermat’s Last Theorem, again under the condition 
that ph, it is convenient to consider a more general equation. (We shall 
use the method of infinite descent, invented by Fermat himself, and for this 
it is essential to choose an equation which will descend to itself.) Suppose 
that there are non-zero integers x,y,z in k, with m dividing z but not x or 
y, which give a solution of 


XP 4+Y? = Z? (67) 
for some unit ¢ in k. Define n by x”||z, and write (67) in the form 


(x +y)(c+ Cy): (2+ CP ty) = e2?. (68) 


At least one of the factors on the left in (68) must be divisible by 7; so all of 
them are, and the highest common factor of any two is 7a where a = (x, y) 
is prime to (7). Hence the factors on the left use up the p residue classes 
mod 7 divisible by 7, so that one of them is divisible by x”; multiplying y 
by a power of ¢, we can take this factor to be z+ y. Thus a||(z + CTy) if 
pr, and 1?("™~Y)411|(z + y); in particular n > 1. 

The product of the ideals associated with the factors on the left of (68) 
is a p-th power, and the highest common factor of any two of them is 7a; 
so as ideals 


(c+¢"y) =mab? forr =0,1,...,p—1. (69) 


Let ¢ be an integral ideal in the class of 65 ', divisible by a but not by (7). 
Thus a~‘c? is principal and hence equal to (@) for some f in o%, so that 
(69) becomes 


(Bx + C"By) = m(cb,)?. 


Since h is prime to p and (cb,)? is a principal ideal, so is ¢b,; if it is equal 
to (a,) then 


Bx + CO" By = me,aP 


for some units e,. Here m”~"||a9 and the other a, are prime to 7. Com- 
bining the equations with r = 0,1 and p — 1 gives 


eras + Ce-10, = (14 C)eoas. 


This implies that Ce_1¢7! = (—a1/a-1)? mod 7, so Ce_1€7' = n? for some 
unit 7 by Lemma 32. Thus 


af + (nay)? = (1+ C)eoez ‘ab, 


and here (1 + C)eg€]! is a unit. So we have generated another solution of 
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(67), and indeed one with x”—"|lao. If we assume that 2, y,z were chosen 
to make n as smail as possible, we obtain a contradiction. This completes 
the discussion of the second case. 

By a refinement of this argument, it can be shown that the second case is 
impossible under the weaker hypothesis p /ho, where ho is the class number 
of Q(¢ + ¢~), the maximal real subfield of Q(¢). (That this hypothesis 
is weaker than p/h is a special case of Exercise 5.3.) Vandiver conjectured 
that p never divides ho, and this has been verified by computer for all 
p< 4x 10°. But naive probabilistic arguments suggest that counterexam- 
ples to Vandiver’s conjecture should be so rare that no feasible amount of 
computation is likely to produce one. 


Exercise The object of this exercise is to provide an elementary proof of 
Lemma 32. The first half consists of the following key result. It can be seen 
as a statement that a certain cohomology group is non-trivial — though 
this interpretation does not make the proof easier. We continue to denote 
the field of p-th roots of unity by k. 


Lemma 33 Let K be cyclic of degree p over k and let o be a generator of 
Gal(K/k); then there is a unit n of K such that normg/,n = 1 but n does 
not have the form e/ce for any unit « of K. 


Note that if we did not require € to be a unit, Lemma 4 would allow us to 
satisfy 7 = €/oe. Write 


U; =OK/ox, U =U;/{torsion part of U;}. 
It follows from Theorem 11 that 
e U is a free abelian group on 3(p — 1)? generators. 


Our terminology will reflect the fact that U is multiplicative. In particular, 
if f(X) = laLX” is in Z[X] we write af = [](c’a)™ for any a in K. 
Let F(X) =14+---+X?7! and recall that F(X) is irreducible in Z[X] by 
Theorem 27; then 67(7) = 1 for every 6 in U. Denote by S(6),... ,4,) for 
any 0,,...,9, in U the set of o”@, forv =0,...,p—-2 and p=1,...,r. 
Write m = 3(p— 1). 


e We can find ™,,... ,4m in U such that the elements of S(6,,... ,@m) are 
multiplicatively independent. 


Suppose that we have chosen 6,,... ,9, where 0 <r < m, and let 6,4, in 
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U be multiplicatively independent of S(@,,... ,6,). Suppose there were a 
relation of the form 


T+1 
| (70) 


where the f,(X) are polynomials of degree at most p — 2 in Z[X] and 
fr+1 #0. Then we could find g9(X), h(X) in Z[X] and N #0 in Z with 
9fr+1 + hF = N. Raising (70) to the g(c)-th power gives 


N v 
a A =1, 


which contradicts the hypothesis on 0,,,. It follows from the last two 
results that S(6,,... ,9m) spans a subgroup of finite index in U. 


e If 0,,...,4m are chosen so that the index of S(6,,...,4mn) in U is as 
small as possible then no 6, has the form 6!~? with @ in U.. 


Suppose otherwise. It is enough to show that @ is not in the subgroup 
generated by S(61,...,9m), because then replacing 9, by @ will replace 
this subgroup by a strictly larger one. Let 


H(X) =(p—1) + (p—2)X +--+ XP? 


so that (1 — X)f(X) = p— F(X) and so 6? = 6f'). If 6 were in the 
subgroup generated by S(91,... ,9m) we would have a contradiction. Now 
note that 62 = (o£°%))1-¢ and deduce that if a is prime to p then OAa-?%, 
Applying an integral unimodular transformation, deduce: 


e if a,,...,@m are integers not all divisible by p then [] Rg does not have 
the form 61~? with @ in U. 


Now let ¢ be a primitive p-th root of unity. In proving the lemma we can 
assume ¢ = €!~? for some unit € in K, for otherwise we can take n = ¢. 
Hence €? = normy,€ is a unit in k, and € is not in k. Now lift each 6, to 
a unit 7, in K and consider the map 


@: {group generated by the n,} — o¢/{p—th roots of unity} 
induced by normy/,. There are three possibilities: 


(i) € is a primitive p-th root of unity. Let 7! be a primitive element in 
the kernel of ¢, and choose 7 to be a product of (7")* with a prime 
to p, a power of € and +1 such that normx/,7 = 1. 

(ii) No non-trivial power of €? is in the image of ¢. The kernel of ¢ has 
rank at least 2, so that there is a primitive element 7! in the kernel 
of ¢ such that norm, 47! = +1. Take 7 = +7}. 
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(iii) Neither (i) nor (ii) holds. Now there is an element 7 in the group 
generated by the n, such that (normx Tue) /€?® is a root of unity 
for some b > 0. Show that we can assume that 7 is not a p-th 
power and proceed as in (ii), using 7° /€° and the kernel of ¢. 


This completes the proof of Lemma 33. For the proof of Lemma 32 let 
7 be as in Lemma 33. We can assume that € is not a p-th power in k; if we 
write K = k(%/e) then K/k is cyclic of degree p. It follows from Lemma 
4 that there is an integer @ in K such that 7 = 6'~°. As an ideal, () is 
invariant under Gal(K/k). By considering the prime factorization of (f) 
and using the fact that there is no ramification in K/k by Lemma 30, show 
that (@) = conormx/,6 for some ideal b in k. For 6 to be principal would 
contradict Lemma 33; on the other hand 6? = normx;,;({) is principal. O 
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Analytic methods 


14 Zeta functions and [-series 


A fundamental tool in the study of prime numbers is the Riemann zeta 
function 


¢(s) = Son-* = [J -p-*)}, (71) 


where the sum is over all integers n > 0 and the product is over all primes 

p; that the two are equal is equivalent to unique factorization. The sum 

and product are both absolutely convergent in Rs > 1, and ¢(s) ~ (s—1)7} 

as s > 1 from the right. The first non-trivial property of ¢(s) is that it can 

be analytically continued to the entire s-plane, subject to a simple pole at 
= 1, and that it satisfies the functional equation 


¢(1 — 8) = 2° ~8I'(s)¢(s) cos $78; 
this equation is equivalent to saying that 
2Z(s) =0(}s)m~*/2¢(s) (72) 


is unchanged by writing 1—s for s. As we shall see below, the extra 
factor in (72) can be regarded as the missing factor in the product (71) 
corresponding to the infinite prime. The other fundamental property of 
¢(s), assuming it is true, is the Riemann hypothesis: that the only zeros of 
Z(s) lie on the line Rs = 3. 

The problem of the distribution of primes in arithmetic progressions led 
to the study of the Dirichlet L-series 


L(s,x) = }>x(n)n7* = [JQ — x(p)p-*) (73) 


where x(n) is a congruence character mod f for some integer f — that 
is to say, x(n) = 0 when 7 is not prime to f, and if n is prime to f then x is 
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induced by a character on the group of residue classes prime to f. L(s,x) 
can also be analytically continued over the whole s-plane, though without 
a pole at s = 1 if x is non-trivial, and it satisfies a functional equation of 
the form 


L(1 —s,x) = f(s, x)L(s,X) 


where f(s, x) is a known and comparatively simple function. This equation 
cannot easily be put into a symmetric form; in fact f(s, x) involves a Gauss 
sum and any symmetry would induce an identity between Gauss sums. 
There is also a Riemann hypothesis for L(s, x). 

Quite generally, to call a function a (global) zeta function or L-series is 
to assert that it is a Dirichlet series )*a,n~°, and that after normalizing 
if necessary by writing s — sq for s it has four key properties: 


(i) It can be written as an Euler product [] ¢p(p~*) where each $(X) 
is a rational function of X; note that this makes no allowance for 
factors corresponding to the infinite prime. 

(ii) The series and product converge absolutely in Rs > 1 but no further; 
and they can be analytically continued to the entire s-plane subject 
to possible poles at s = 0 and/or 1. 

(iii) There is a functional equation connecting L(s) with L(1 — 3), where 
the latter expression is often obviously the same as L(1 — s). 

(iv) The non-trivial zeros of L(s) lie on Rs = 3}. 


The function may also have zeros at z = —n for certain n > 0; these should 
be thought of as the consequence of failing to include a factor corresponding 
to the infinite prime in the product required by (i). Fortunately, these four 
properties only need to be asserted as conjectures and not as proven facts; 
indeed modular forms give rise to examples where even (i) is not trivial, 
and there is as yet no case where (iv) has been proved. On the other hand, I 
know of no case in which the second part of (ii) has been proved without the 
argument simultaneously proving (iii). A function having these properties 
is usually called a zeta function if it has a pole at s = 1 and an L-series 
otherwise; but this is not a firm rule. 


For an arbitrary algebraic number field k, the only possible analogue of 
the Riemann zeta function which satisfies (i) above is 


Ce(s) = }\(Norma)~* = [](1 — (Normp)~*)~', (74) 


where the sum is over all non-zero integral ideals a and the product is over 
all prime ideals p. Again, this identity is equivalent to unique factorization. 
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To any rational prime p there correspond at most [k : Q] primes p; so the 
product is absolutely convergent in Rs > 1 and hence so is the sum. 

It is reasonably easy to study the behaviour of ¢,(s) as s — 1 from the 
right. One rewrites the Dirichlet series as 


foo} 
Ck(s) = So ann~* a aa z*~A(x)dz 
; 0 
where A(z) = 7 ,.¢,4n; so it is enough to estimate A(x). The calculation 
is essentially that in the proof of Theorem 23, and the conclusion is 


(s —1)Ce(s) + 2" (2m)"|d|-/2hRw-? as s 1. (75) 


We shall not carry out this process explicitly because the result is contained 
in Theorem 28 in the next section. The formula (75) is among other things 
the starting point of the proof of the Brauer-Siegel Theorem: that if k is 
Galois over Q then 


log(hR) ~ 5 log|d| as |d| + co (76) 


for fields of any fixed degree; see [L], Chapter XVI. Empirical evidence 
suggests that if r; + r2 > 1 (so that FR is non-trivial) then log h is usually 
very small compared to log |d|, though this is not so for the families of fields 
which one is most likely to write down; but it is very unlikely that there is 
anything analogous to (76) for A or R separately. 

Analytic continuation and the functional equation for ¢,(s) are much 
more difficult, and were only proved by Hecke in 1918 in a remarkable 
paper. With the advantages of hindsight, the basic idea of Hecke’s paper 
is fairly natural; but the details are complicated and the proof depends on 
what appears at first sight to be a lucky accident. Hecke’s first result was 
that 


Cu(s){m~ 9/90 (5 8) }72 { (2m)! 91 (8) }* | di 9/7 (77) 


is unchanged on writing 1— s for s. It is now natural to guess that 
n—$/20(3s) and (2m)'~*I'(s) are the missing factors in the product formula 
for ¢,(s) corresponding to a real and a complex infinite place respectively, 
even though they look totally unlike the factors in (74) coming from the 
finite primes. We have already seen that the discriminant d = tNorm?) is 
conductor-like, so that the factor |d,|°/* = (Normd)*/? should be thought 
of as related to the conductor of the field k. 

For L-series, the situation is more complicated. Choose a finite set S of 
prime ideals and denote by J* the group of those fractional ideals whose 
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prime factorizations do not involve any ideal in S. Let x be a character of 
I and write 


L(s, x) = > x(a)(Norm a)~* = | [(1 — x(p)(Normp)~*)? (78) 


where the sum is restricted to integral ideals in J° and the product to 
prime ideals not in S. To prove analytic continuation and a functional 
equation, we need to impose certain conditions on x which constitute the 
definition of a Hecke Gréssencharakter. However, the appropriate factors 
for the infinite places are no longer those that appear in (72) and (77); 
we have to replace s by s + sy, where s, depends both on x and on the 
particular place we are looking at. Replacing x by y~! reverses the signs 
of all the s,. Write 


T(s,x) =] (x @t)r2(s + sv))}]] {(2n)2- TF (s + 50)} 


where the first product is taken over the real and the second over the 
complex places. Hecke’s second result was that 


A(s, x) = L(s, x)P(s, x){Norm(fo)}*/? (79) 


is multiplied by a number of absolute value 1, depending only on x, when 
s is replaced by 1 — s and y by x~!. He expressed this last number as a 
finite sum, which generalizes the Gauss sum which appears in the classical 
case k = Q. 

Tate’s thesis, which was written in 1950 but not published until 1967 (as 
Chapter XV of [CF]), presented a new proof of analytic continuation and 
the functional equation, using the language of adéles and idéles. General 
theory now replaces the heavy calculations needed by Hecke, but some at 
least of the ideas remain the same. Tate’s proof is given in the next section. 


A completely different generalization of the L-series (78) was introduced 
by Artin. To explain his motivation, we return to (73). Let K be the field 
of f-th roots of unity. By the Corollary to Theorem 27, we can interpret 
x as a character on Gal(K/Q); and (73) then becomes 


L(s,x) = T] (1—x(428))0*) 


where (423) is the Artin element and the product is taken over all p 
prime to: f. If we assume the assertions in §17, we can give an exactly 
analogous interpretation of (78) in terms of a certain abelian extension 
K/k. What Artin did was to consider extensions K/k which are Galois 
but not necessarily abelian. Instead of a character x it is now necessary 
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to use a representation p of Gal(K/k) — that is, a homomorphism p : 
Gal(K/k) + GL(W,C) where W is a finite dimensional vector space. If p 
is a prime ideal in k unramified in K/k and $ is one of the primes of K 
above p, then the associated factor in the Artin L-series is 


{det {I — p({444]) (Norm p)-*}}7}, 


which does not depend on the choice of % or of p within its equivalence 
class. We have still to take account of ramified primes. If p is ramified in 
K/k then the Frobenius element [AA] is a coset of the inertia group Ty; 
so the space it naturally acts on is W7, the subset of W whose elements 
are invariant under the action of T via p. Thus the L-series which we are 
led to consider is 


L(s, p) = Il, {det {7 - (Norm p)~*p((444}) acting on wry” ; 


This is the Artin L-series. 
For any o in Gal(K/k) the characteristic roots of p(a) have absolute 
value 1; so the product is absolutely convergent in R(s) > 1. Clearly 


L(s,p1 + p2) = L(s, pi) L(s, p2); 


and if p is the principal representation (for which dimW = 1 and every 
p(c) is the identity) then L(s, p) = ¢,(s). It is known that L(s,p) can be 
analytically continued to the whole s-plane as a meromorphic function and 
that it satisfies a functional equation of the usual type. It is conjectured 
that if p is irreducible and non-principal then L(s, p) is everywhere holo- 
morphic. The importance of this conjecture is that it would imply that 
¢x(s)/Ck(s) is holomorphic for any algebraic number fields K,k with K 
Galois over k. 


The expression for ¢x(s) as a product tells us how rational primes p 
factorize in K. Suppose for convenience that K is normal over Q and let 
L be any field intermediate between Q and K. If we know how p factorizes 
in K then Theorem 16 tells us how it factorizes in L. Since there are only 
a very limited number of possibilities to consider (especially if we confine 
ourselves to unramified primes, as in fact we can), we can reasonably hope 
to have product relations between ¢x(s) and the various ¢,(s) including 
¢q(s). All this is more easily understood from the example to be found in 
§12. For the general theory, see [FT], Chapter VIII.7. 
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In this section + will denote the Haar measure on V,, normalized so that 
ut(V./k) = 1, and pu will be a Haar measure on J,; we shall defer for 
as long as possible introducing the factor which comes from the particular 
normalization of 4 which we chose in §A1.3. We now define a Tate zeta 
function; we shall see later how it relates to Hecke’s zeta function. Let y 
be a continuous character on J, which is trivial on k*; we shall call such a 
character a Tate character. Let f : V; — C be a function so well behaved 
that all the formal manipulations which follow are valid. By analogy with 
the Fourier transform on J; we define 


Ciea= i; Ha)x(a)|laxl|°dy, (80) 


where we require f(a) to die away so rapidly as ||a|| > co that the integral 
converges absolutely in some right hand half-plane Rs > oo. This formula 
defines an analytic function of s in this half-plane; our main task is to 
continue it to the entire s-plane. For this it is crucial that y is trivial on 
k*. 

We divide J, into two parts: J* on which |/a|| > 1 and J< on which 
|a|| < 1; we can ignore the part with ||a|| = 1 because it has measure 
0. The integral over J* is absolutely convergent and defines an analytic 
function for all s, since these properties hold for Rs > oo; it is the integral 
over J< which we have to worry about. Let S be a fundamental domain 
for the action of k* on J< and write J< = UES where € runs through the 
elements of k*; then 

[sertariattau = > ff tee)x(a)Ieui*au 


€ in k* 


= [£5 X Hee) x(a)latay— 0) ? x(q) |la|*dp. 


éink 


In the first term on the right we apply the corollary to Theorem A6 (the 
Poisson Summation Formula); since ut (Vi../k) = 1, we can replace the term 
in curly brackets by 


Hel De in yfla8)- 


Write 3~' for a and note that in an obvious notation du(@) = du(a) and 
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S~} is a fundamental domain for the action of k* on J>. Thus 


[ | 3 foe xed [ | 3 joo} xo 


€ink €Eink 


5 ye D» 130} xe 090i 


€ink 


Defining { by %(8) = x(G~') and interchanging integration and summa- 
tion, 


; fla)x(a)[lexl|*du + F (0) [ x(a) [herd 
J< Ss 


: : (81) 
= | AAR @Iar*du+ fof earNaI-ray 
J> fe 


Let us further assume that f(@) dies away so rapidly as ||G|| + co that 
¢(f, %, 8) also exists for Rs large and positive; then the first integral on the 
right exists for all s. 

We must now look at the second term on each side of (81). Suppose first 
that x is not trivial on Jj, and choose in J} with x(y) 41. Writing ay 
for a, we obtain 


a x(a) |lau|du = x(7) i. x(a)|lexl|*dy; 
s s 


so this expression must vanish. Hence in this case the second term on the 
left in (81) vanishes, and the same is true of the second term on the right. 
On the other hand, if x is trivial on J} then the value of x(a) depends 
only on ||a|] and hence x(a) = ||a|° for some pure imaginary c, since these 
are the only characters on J,/J} + R*+. Here we can set c = 0, which is 
equivalent to absorbing c into s. Using the decomposition J, = J} x (0, co) 
by which we defined the induced measure on J}, we obtain 


pw (Jp /k*) 
8 


8 1 1 * gat 
[ie du = wi(ap/e) f po 
s 0 


where yu! is the induced measure on J}. The other similar integral is equal 
to p)(Ji/k*)/(s — 1). Thus we finally obtain 


Theorem 28 Assume that if x is trivial on J} then it is normalized so as 
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to be trivial on J,. Then under suitable conditions on f, 


chixs) = ff {rla)x(a}llal’ + fla)R(a) hall") 


+ (@ - £01) pi (IE/R) 


(82) 


where the last term only occurs if x(a) is trivial. The right hand side is 
analytic in the entire s-plane except for the possible poles at s = 1 and 
s = 0 arising from the last term on the right. Moreover 


Chi 8) =¢(f,%,1 = 3s), (83) 


The last sentence follows immediately from the main formula, bearing in 
mind that we have identified vy with V., so that the transform of f (a) 
viewed as a function on Vi, is f(—a). 

Note that (82) is homogeneous in the Haar measure p on Jz, so that 
we have not yet committed ourselves to a particular normalization of py. 
Subject to this, the value of y}(J}/k*) can be read off from Theorem 23. 
The reader will have noticed that we have not used the fact that y is a 
character, but only that it is multiplicative; so we could have taken x to 
be any quasi-character J, — C* trivial on k*. But this apparent extra 
generality is spurious. For since Jj/k* is compact, x induces a character 
on Jj; and since the kernel of the map @ + |\a| is J}, we must have 
|x(a)| = |la||¢ for some real c. Now replacing x by the character a + 
|al|-*x(a@) is equivalent to replacing s by s —c. 


To make use of these results, we need to evaluate the integrals which 
define f and ¢; and in any case we need to ensure that our hypotheses on 
f and x can be satisfied. It is natural to choose f and x to be products of 
functions defined on the factors ky and k3 respectively: 


f() =T[ AG), x(a) = [[ xe). 


The easy way to ensure that these products converge is to require for almost 
all v that f, = 1 on o, and x, = 1 on o%. If we also require f, to vanish 
outside o, for almost all v, then 


fe= [ seaelenidu = T] fle) 


for the integral over Vi, is equal to the product of the integrals over the k, 
and c(a) = [J (ay). 


15 The functional equation 87 


A similar argument gives the corresponding result 
(fixes) =T] [ fol@o)xo(an)lauliSdi, (84) 
v wv 


where now the measure on J, is the product of the measures on the k*, as 
in Theorem 23. For if v is p and f, vanishes outside o,, the corresponding 
factor is 


Cp(fps Xp. 5) = a fp(p)Xp(%p) Ilo llp dup. (85) 
n=0 m\prtt 


Here the n-th term is the product of (Normp)~"* and a coefficient inde- 
pendent of s. If the formal product of these series is absolutely convergent 
and so are the remaining integrals in (84), then the non-zero terms of the 
product are precisely the ones we need for an integral over J}. Here we 
treat the product over the non-Archimedean valuations as a Dirichlet series 
and therefore only retain the corresponding terms in the product; this is 
the same convention which we used to expand the Euler product in (71). 

We now evaluate the local factors for those primes p for which yp = 1 
on of. We choose 


_ f 1 for & in op, 
fo(&) = { 0 otherwise. 


If 0, denotes the local different for k/Q at p then as in §A2.2 


(86) 


= ‘= a4 
Br vine + _ § (Normd,)~'/? for np in 05%, 
Hult) = ff expt2xing éom)aug = { SOMO Oa 
because the integrand is a character on 0p which is trivial if and only if np 
is in 0, '. By hypothesis xp(ap) depends only on ||ap||, and must therefore 
have the form 
Xp(Qp) = ll%p ll" 


for s) pure imaginary; here s; may depend on p. Using (85), the corre- 
sponding local factor of the zeta function ¢(f,x, 5) is then 


32 (xp(7))" (Norm p*)~* = (1 — xp(7)(Norm p)~*)~? 
0 


where 7 satisfies p||r. Similarly, if p"||, the local factor of C(f, %, s) is 
(Norm 0, )*~"/? x(x" )(1 — Xp(")(Norm p)-*)?. 
Let S be a finite set of places which includes the Archimedean places and all 
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the primes p excluded from the discussion above. Write x’(p) = xp(7) for 
all p outside S and use multiplicativity to extend the definition of x’ to I, 
the group of those fractional ideals in J;, whose factorizations do not involve 
any prime ideal in S. We shall see in Lemma 34 below that the characters 
x’ obtained in this way are precisely the Hecke Gréssencharakters, and 
L(s,x’) below is the Hecke L-series. Thus up to finitely many factors 
corresponding to the places in S, 


C(f,x,8) = []@-x°(p)(Normp)-*)™?, 


where by analogy with (78) it is natural to call the right hand side L(s, x’). 
It follows from (83) and the expressions above for the local factors that 


. L(1 - 8x") _ x’ (07)(Norm 2d. ye- TT Cul (for Xu $ s) (87) 
L(s, x’) 5G,(fs, Xo 1-8) 
where 0, is the product of the 0, for p not in S. In other words, we get a 
functional equation for L(s, x’), provided that the local characters x, have 
been chosen so that x = [] Xv is trivial on k*. To satisfy this condition, 
we need to see what local characters yy are available. 

Suppose first that v comes from a finite prime p. The 1 + p™ are a 
decreasing sequence of small subgroups, in the sense explained on page 
126; so Xp is trivial on those which have m > mp, where we take m, to 
be as small as possible. We call p™» the conductor of the restriction of 
Xp to of; that restriction is induced by a character on the finite group 

o/(1+p™»), and we are free to select any such character as yp. Now fix 
an element x in k* with pila and write a, = 7”8, where (, is in of; if we 
define s, by xp(7) = ||7||p” then the most general x, is given by 


Xp(@p) = Xp (Bp) lop llp” 


Here s, is really an element of i1R mod (277/ log ||7||p) rather than of iR. 
Next suppose that v is real. Since the characters on (0,00) are just the 
xz —» x*» for some pure imaginary s,, there are just two classes of characters: 


Xu(2) =|2|* or xv(x) = [z|*signz. (88) 


Finally, suppose that v is complex. Let z = re’; then by considering the 
restrictions of x, to 9 =0 and to r = 1 we obtain 


Xv(z) =r? ei? (89) 


for some pure imaginary s, and some integer n,. The effect of increasing 
all the s, by the same constant c is to multiply x(a) by |lal|°, which is 
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equivalent to a translation on s; but in general there is no obviously natural 
way of choosing c. Also, replacing x by y~! reverses the signs of all the sy. 

We have still to ensure that x = [] xz is trivial on k*; we achieve this 
by forcing x to be trivial successively on roots of unity, on of modulo 
roots of unity, and on k*/oj. First choose the presence or absence of the 
factor sign z in (88) for each real valuation and the value of n in (89) for 
each complex valuation. Choose also a finite set S of non-Archimedean 
valuations (which will be the ones for which x, is non-trivial on of) and for 
each of them choose a character xp on of, subject only to the condition that 
x(w) = 1 if w generates the group of roots of unity in k*. The conductor 
of x, denoted by f, will be the product of the local conductors p™» defined 
above. We shall show that for each set of such choices there are exactly h 
admissible characters x, where h is the class number of k. 

The values of s, for the Archimedean valuations are determined up to 
the addition of a common constant by the condition that y is trivial on 
a base for of modulo roots of unity; for this imposes r; + rg — 1 linear 
conditions on these s, whose determinant (with some abuse of language) 
is the regulator. These equations have the form }> a,,Sy, = ib, where the 
Gyv,6, are real; so we can take the s, to be pure imaginary. In view of 
the remarks after (89) we can assume a fixed choice of these values. Now 
denote by Jf” the group of idéles 8 = [] G, such that , is in of for each 
p; thus J‘ is both the kernel of the map J, — J, and the set of idéles for 
which the choices above have already determined the value of yx. There is 
an exact sequence 


0 of — k* x JPY Ip 3 C, 0, (90) 


in which the middle map is a x 8 + a7! and the one following it is 
J — Ip — Cy. Define x on the image of ¢ by x(¢(a x B)) = x(B) where 
@ is the middle map in (90); x is well-defined because @ is determined up 
to multiplication by an element of of and we have already arranged that 
x is trivial on of. Using Theorem A1, we see that x can be extended to a 
character on J, in just h ways; and it is easy to see that these are precisely 
the characters on J, which extend the restriction of x to J*'” and are 
trivial on k*. 

To express the condition which defines a Hecke Gréssencharakter w for an 
algebraic number field k requires an integral ideal f of k, a pure imaginary 
number 8, for each infinite place v of k and a rational integer n, for each 
complex place v. Let S be the set of prime ideals which divide f and denote 
by I® the set of fractional ideals in k whose prime factorization involves no 
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ideal in S. A character w on IS is called a Hecke Gréssencharakter if 


#((a)) = {T] loval*} {TT ver/tova)-™ } (91) 


whenever a = 1modf and g,a > 0 for all real embeddings a, : k — R. 
In (91) oy is the embedding k — C, the first product is taken over all 
infinite places and the second over all complex infinite places. For given f 
we are free to choose the n,, but there are considerable constraints on the 
Sy arising from the fact that ((a)) = 1 whenever a is a unit. 


Lemma 34 The characters x’ derived from Tate characters x are the Hecke 
Gréssencharakters. 


Proof Let x be a Tate character with conductor f, and let ny, $y be as in (88) 
and (89). If a is as above then yp(a) = 1 for pin S, sol = x(a) = [] xu(a) 
implies 


x°((a)) 7 II, not in 5X0) = {TI} 


where the right hand product is over all Archimedean v; and this is (91). 

Conversely, suppose that w satisfies (91). We shall require y to have 
conductor f, so that xp for p not in S is completely specified by the require- 
ment xp(7) = ¥(p). Since x’ only depends on these x»(7), this ensures 
that x’ = wp if we can construct x at all. If v is Archimedean then xy is 
specified by (88) and (89) where n,, s, are as in (91), except that we have 
still the choice between the two alternatives in (88). By the argument of the 
previous paragraph we have ensured that x(a) = 1 for all a = 1 modf such 
that @ is positive at all real places. If merely a = 1modf then x(a”) = 1; 
since 


{a = 1mod f}/{a = 1 modf and a > 0 at all real places} ~ {+1}", 


there is just one way of making the r, choices (88) so that x(a) = 1 for all 
a = 1modf. Similarly there is then just one way of choosing Xp on of for 
each p in S so that x(a) = 1 for all a which are units at p for each such 
p, and finally just one way of choosing the xp(7) for p in S to ensure that 
x(a) = 1 for all a in k*. 

All that is left to do is some tidying up. It follows from (87) that the 
quotient 


ace Xo 1 — 8)/Cv( fo. Xv, 8) 


does not depend on the choice of f,. This can be proved directly, because 
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it is equivalent to 


Cu( fs Xvs 8)Cu(Gor Kur 1 — 8) = Cu(gus Xu, 8)Cu( for Xo 1 — 8) 


for any admissible functions f,,g, on ky. The left hand side here is 


| | fol) (B)Xv(08-?)|faB Is [Bll dadus 
ks Jk 


=| { [ tloreterlaty tua} oT DNB" dp 
on writing 3 = ay. Since |lal|ydua = Avdut for a constant , whose value 
is unimportant, the expression in curly brackets is 


xf H@) ih ao Beter8) aut au 


where c, is the additive character which appears in the Fourier transform. 
This last expression is clearly symmetric in f, and gy. 

Nevertheless, we should choose these f, to make everything as simple as 
possible. If v is real the best choice for f, is 


_f exp(-mz") if x(z) = |x|", 
fo(a) = { rexp(—mz”) if x(x) = |z|*signz, 


where the reader is warned that 7 denotes 3-14159... when discussing 
infinite places. Now f,(z) = f,(x) in the first case and f,(z) = if,(x) in 
the second. Indeed these are just the identities 


fo 0) 
i exp(—my? + 2nizy)dy = exp(—7z’), (93) 
—co 
CO 
/ yexp(—ry” + 2xizy)dy = iz exp(—xz"), (94) 
—0o 


where (93) is standard and (94) is obtained by differentiating with respect 
to z. The corresponding factor ¢,( fv, Xv, 8) reduces to the standard integral 
for the Gamma function; its value is 1~(*+$~)/2— (3(s + sy)) in the first 
case and a~(¢t#e+))/27(3(s5 + sy + 1)) in the second. Similarly the value 
of Gh. Xv, 1-8) is obtained by writing 1— s for s and reversing the sign 
of s,, and also multiplying by 7 in the second case. 

If v is complex then we write z = x + iy and in an obvious notation take 


pee (x — ty)!" exp(—2n(z? + y?)) if n>0, 
om" Lo (2 + ty)" exp(—2n(2? + y?)) if n<0. 
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I claim that this gives fon(z) = il"lf, _,(z). The minus sign on the right 
ensures that ¥(re*?) = r~2%ve~*"9_ For n = 0 the assertion is that 


oO fo) 
7 i exp(—2a(u? + v?) + 4ai(zu — yv))du dv = $ exp(—2m(x? +y7)), 
-00 J —00 


which is just the product of (93) with itself. The general case is now 


obtained by applying th 24:2)" i 2-i2\" 
y applying the operator dz +13, ifn 20, or ( 5 1 


ifn <0. The factor Cy(fy, Xv 8) is (24)1~#-#FI"I/2T(s + 5, + 3|n|). The 
value of ¢,( fos Xv, 1 — s) is again obtained from this by writing 1 — s for s 
and reversing the sign of sy. 

The remaining cases are when v is non-Archimedean and p|f. Now we 
can no longer define f, by (86) because ¢,( fp, Xp, 8) would vanish if we did 
so; there is no obviously best choice for f,, and we shall take 


flax ci(z) = exp(27i Tr, /Qp(Z)) for z in Oe fa 
P 0 otherwise 


in the notation of Lemma A6, where f, is the conductor of x, as on page 
88. Thus 

Bis de. (Normd,)!/2Normf, if y = 1mod fp, 
foto) = [,er-vle) aut ={ : ean 


by Lemma A5. If for convenience we define m,r by p™||fp, p”||Op, then 


co 


GUinxe8)= > tpl f 
a 


n=—m-r Pp 


wear (UF) X0(Z) dts 


I claim that every term in this sum vanishes except the first. 
Suppose first that n > —r, so that ci(z) = 1 on p” \ p®*!. Writing 
xz =n"y, the integral becomes 


Foe) te = x00) | x00) dy = 0 
p\pr+1 07 
because the restriction of xp to of is non-trivial. 
Now suppose that —m —r <n <—r. We write p” \ p*+! as a union of 
sets Z9 + p"” = zo(1 +p"). On such a set ci(z) = ci(z9), so 


a c1(£)Xp(z) dus = Ser(20)xo(a0) | Xp(y) dy; 
p™\pr+} Zo Ap peree 


but 0 < —n~—r <M, so xX, is a non-trivial character on 1 + p~"~" and 


15 The functional equation 93 


each integral on the right vanishes. Thus finally 
Col fps Xp. $ a Ss 
ae = (Norm f,2p)*~!/?xp(m)""""W, (xp), (98) 
plJp» Xp. 4 — 


where writing z7~™~" for z in the integral gives 
Woe) = (Norm fy)? f cx(er-™")xp(t) due (98) 
% 


If, copying the earlier argument, we write x = e(1+y) where € runs through 
a set of representatives for of /(1+p™) and y is in p”, then we obtain 


Wo(Xp) = (Norm fp)~/?'S Vex (en) xp(€) 
which can be described as a generalized local Gauss sum. The canonical 
property 
|W,(xXp)| = 1 (97) 
can be proved as in §13.1, or we can argue as follows. On the one hand, the 
factor associated with p on the right of (87) is independent of the choice of 
fp. If instead we use gp = f, then 9,(z) = f,(—z); so this factor is equal 
to 
ms -1 
Go(9p»Xp»8)__  ¢_4) ) Spl for Xp 1 — 8) 
am = X»(-1) 4 = 
Cp( fp, Xp» 1 — 8) Cp( fps Xp» 8) 
= xp(—1)(Norm (0pfp))*~/?Xp(m™*")/Wo (Xp) 
by (95), where the factor xp(—1) comes from the need to write —z for x 
in the integral (96). Now set. s = 5 and write Xp instead of xp; comparing 
the last result with (95) we obtain 
W,(xp)Wp(Xp) = Xp(—1). 
On the other hand, it follows from (96) on writing —zx for z that 


Wo(xp) = Xp(—1)Wp (Xp) = Xp(—1)Wp(Xp)- 
These two results together give (97). 

We can now finally write down the functional equation in full detail. We 
again define A(s, x) by (79), though now I(s, x) is the product of factors 
for each infinite place which are 

n(+se)/2P(1 (5 + Sy)) for the first case in (92), 
a (s+8ut1)/27(1(5 4 5 41)) for the second case in (92), 
2 


(27)~8-# "1/27 (5 + sy + 4|n|) if v is complex. 
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Combining all our previous results now gives 


A(L—s,x*) 
TS Macphee Wo (xp) 
A(s, x) J. 
where ¢ is the number of real places for which y,(xr) = |z|*sign z. Q 


16 Density theorems 


Euclid proved that there are an infinity of rational primes. Dirichlet proved 
that any congruence class modn which is prime to n contains an infinity 
of primes; and a much more general theorem of Cebotarev will be found 
below. On the other hand, it has not been proved that there are an infinity 
of n for which n? + 1 is prime, or for which n and n + 2 are both prime. 

Most proofs that there are an infinity of primes having a given property 
depend on showing that the set of such primes has positive density. There 
are two standard measures of density even for sets of rational primes, and 
they both generalize to an arbitrary algebraic number field k. Let S be a 
set of prime ideals of k and So the set of all prime ideals of k. The most 
common definition of the density of S is 


lim x -00(N(S, X)/N (So, X)) (98) 


provided this limit exists; here N(S,X) is the number of prime ideals p 
in S which satisfy Normp < X and similarly for N(So,X). The prime 
number theorem for k (which we shall not prove) states that 


N(So,X) = ext +0(1)). 


The alternative measure is the Dirichlet density, which is defined as 


a os in =(Normpy-*/S0, in as(Normp)*} ” 


provided this limit exists. In order not to have to prove analytic con- 
tinuation, it is usual to take the limit as $s tends to 1 from above. The 
denominator in (99) is asymptotic to —log(s — 1), as can easily be seen 
from (74) and (75). 

Whether we work with (98) or (99), the density of S depends only on 
the first degree primes in S — that is, the prime ideals p for which Norm p 
is a prime in Q; for it is easy to show that the set of all primes in k which 
are not first degree has density zero. 

If the limit (98) exists so does the limit (99) and they have the same value; 
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see Exercise 4.2. But (99) can exist even when (98) does not, though such 
cases are probably pathological. More to the point, (99) is usually much 
easier to prove than (98); for the natural line of attack on (98) requires 
a knowledge of the behaviour of the numerator of (99) on the whole line 
Rs = 1. For applications, what is normally needed is to know that the 
density of S is strictly positive; but it does not matter which density one 
uses. Each of the three theorems in this section is stated and proved for 
Dirichlet density, but remains true if we replace Dirichlet density by density 
as defined by (98). 

The standard way to prove a density theorem appears in its simplest 
form in Dirichlet’s theorem on primes in arithmetic progression. 


Theorem 29 Let m > 2 and let c be a fixed integer prime to m; then the 
rational primes which satisfy p= cmodm have Dirichlet density (¢(m))—! 
where @ is Euler’s function. 


Proof Let k = Q( V1) and let x be a character on Gal(k/Q); by the 
Corollary to Theorem 27, x induces a character on (Z/mZ)* which we 
shall also denote by x. Suppose that x is not principal. In the notation of 
(73) 


L(s,x) = yy Sa(n ~(n+1)~*) where Sp = Yo x(r). 


But n-* — (n +.1)~* = O(n7!~?) where o = Rs, and S,, = 0 for any 
integer v by Lemma A5, so that S, is bounded; so L(s,x) is analytic in 
Rs > 0 and in particular at s = 1. Again, if p has order f as an element of 
(Z/mZ)* then x(p) runs through the f-th roots of unity as x varies, and 
x(p) takes each value equally often; so 


Il, — x(p)p~*) = (1 — p48) 9Or)/F, 
Using the Corollary to Theorem 27 again, it follows that 
Ck(s) Il, (s, x) (100) 


up to finitely many factors corresponding to the primes dividing m. Here 
¢.(s) has a simple pole at s = 1 and so has L(s, xq) where xo is the principal 
character; for L(s, xo) is ¢q(s) up to finitely many factors. All the other 
factors on the right in (100) are analytic at s = 1, so (100) implies 


L(l,x) #0 for x# xo. 
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Since log(1 — x(p)p~*) = —x(p)p~* + O(p-?7) where o = Rs, we obtain 


log L(s, x) = }_ x(p)p~* + O(1) 
in Rs > 1. Thus 


(mm) cemodm? | = Dap day X()X(P)P* 
= XC) log Ls, x) + 001). 
A similar argument gives 
log L(s, x0) = log Gq(s) + O(1) = > p~* + 0(2), 
Combining these two equations with (100) gives the theorem. QO 


The same ideas work for an arbitrary algebraic number field k, but we 
have to appeal to much more sophisticated results. We now use the notation 
of §17. A weaker version of the following result will reappear as Theorem 
35. 


Theorem 30 Let k be an algebraic number field and A/H a congruence 
divisor class group in k. Let C be a coset of H in A. Then the Dirichlet 
density of the prime ideals in C is N~!, where N = card(A/H). 


Proof Let K be the abelian extension of k which is the class field for A/H 
in the sense of Theorem 32, and let x run through the characters of A/H. 
If xo is the principal character then L(s, xo) is equal to ¢,(s) up to finitely 
many factors; so L(s, xo) has a simple pole at s = 1. Any other character 
x satisfies x((a)) = 1 whenever a = 1 mod f and o,a > 0 for all real places; 
so by Lemma 34 it comes from a Tate character which is clearly non-trivial. 
Hence L(s,x) is analytic at s = 1, and using Theorem 37 instead of (100) 
we again obtain L(1,x) # 0. The rest of the proof follows that of the 
previous theorem. oO 


The isomorphism between A/H and Gal(K/k) given by the Artin symbol 
implies that we can identify the set of prime ideals in C with the set of 
prime ideals p for which the Artin symbol (4) takes a given value. But 
this is only a special case of a more general result, the Cebotarev Density 
Theorem, for which we only need to assume that K/k is Galois and not 
necessarily that it is abelian. 


Theorem 31 (Cebotarev) Let K,k be algebraic number fields such that 
K is Galois over k, let o be an element of Gal(K/k) and denote by (co) 
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the conjugacy class of 0. Let S be the set of prime ideals p of k such that 
for every % above p the Frobenius element [ae lies in (oc). Then S has 
Dirichlet density card((o))/card(Gal(K/k)). 


Proof Let f be the order of o and L the fixed field of o, so that Z is 
the splitting field of $$ whenever (AA) =o. The Dirichlet density of the 
primes q in LZ with (x4) =o is f—!, by the remarks after the proof of 
Theorem 30. If p is in S its prime factors are the distinct 73 with 7 in 
Gal(K/k); 7 fixes B if and only if 7 is in the cyclic group generated by o, 
and (AGI = (A) if and only if 7 is in Z(c), the centralizer of o. Hence 
the number of distinct $8 above p with (AF) =o is f~lcard(Z(c)); and 
the Dirichlet density of S is 
f=} - 1 ___ card((o)) 
folcard(Z(c))  card(Z()) —_— card(Gal(K/k)) 


because there is a one-one correspondence between the P with (ASI =o 
and the q with (AZ) =o, and Normq = Norm p. QO 


5 


Class Field Theory 


17 The classical theory 


Let k be an algebraic number field and K a finite abelian extension of k. 
The objective of classical class field theory, which was largely achieved, was 
to describe the properties of K in terms of objects in k. The theory was 
first formulated in the 1890s, partly by Weber (following Kronecker) and 
partly by Hilbert (following Kummer); but one crucial component was only 
provided by Artin in 1927. The first proofs were given by Takagi in the 
1920s; he used complicated group-theoretic arguments which we now know 
to belong to group cohomology — a subject which at that time had not 
been invented. 

Let k be an algebraic number field with class number h > 1. It is 
straightforward to show that there are algebraic number fields K D k such 
that every ideal in k becomes principal in K. Is there a canonical way of 
choosing K, and what additional properties will the extension K/k have? 
Hilbert conjectured that there is just one such field K with the following 
additional properties: 


(i) 0% = (1), so that the extension is unramified at all finite places; 

(ii) the extension is also unramified at all infinite places, so that the places 
of K above a real infinite place of k are all real; 

(iii) K is abelian over k with Galois group isomorphic to the ideal class 
group I, of k. 


He called such a field the absolute class field of k, and proved its existence 
in certain cases, for example when h = 2. This conjecture was reinforced 
by one of the key tools in Kummer’s work on Fermat’s Last. Theorem: if k 
is an algebraic number field of class number A and K an abelian extension 


98 
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of k with 0«/, = 1 and Gal(K/k) ~ Z/(p) where p is prime, then p|h and 
there are just p ideal classes in k whose members become principal in K. 

It had been shown by Kummer that the cyclotomic fields (the subfields of 
Q(e) where ¢ is a root of unity) were abelian over Q and that how a prime 
p factorized in such a field depended only on congruence properties of p. 
Kronecker conjectured the converse of this: that every abelian extension 
of Q is a cyclotomic field. A nearly complete proof was given by Weber, 
though it took a further generation before all the gaps in his argument were 
filled in. The result is however an easy consequence of the general theory, 
as well as being a major inspiration for its formulation. 

The examples obtained in these ways led Weber to the general ideas of 
a congruence divisor class group and of a class field. To define the 
former, let m be the formal product of an integral ideal in k and possibly 
some real infinite places in k; thus we can formally write m = [] pu” where 
some of the p, may be infinite real places. Denote by A, the subgroup of 
I, consisting of the fractional ideals whose prime factorizations involve no 
prime ideal dividing m; here we only take account of the finite primes p, in 
m. Let H® be the subgroup of Ap consisting of those principal ideals which 
can be written as (a) where a = 1modm — that is, a = 1modp," in kp, 
for each finite p, and ca > 0 if p, is a real infinite place and 0: k — R 
is the corresponding map. In this way there corresponds to any m the 
quotient group Am/H®, which is easily seen to be finite. In what follows, 
we shall denote by Hm any subgroup of A» which contains H°. 

Suppose that m|n in the obvious sense; then A, > A, and H® > H°. 
Moreover, Hy = Hm An D He? for any Hm; and since each coset of H® in 
Am meets A, there is a canonical isomorphism 


An/Hy © Am/Hm. 


Let m1, m2 be two such formal divisors with associated groups Hm,, Hm,- 
We shall say that these two groups are equivalent if for some common 
multiple m of m,, m2 (and thus for any common multiple) 


Am 1 Hm, = Am Hing: 


It is easy to show that this is an equivalence relation Hp, ~ Hm, and that 
the quotient group Am/Hm does not depend on the choice of m. We call 
the equivalence class of such quotient groups a congruence divisor class 
group A/H, and we call the least modulus m for which it can be realized 
the conductor f of A/H. 

A finite algebraic extension K of k is called a class field for A/H if the 
prime ideals p in k which split completely in K (that is, those which have 
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€x/k = fx/k = 1 for each prime of K above p) are precisely those which 
belong to H. For the real infinite places in f this needs an elucidation: we 
require that the extensions to K of a real infinite place in k are all complex 
if it is in f and all real otherwise. We can now state the key results of the 
classical theory. 


Theorem 32 Every abelian extension K/k is a class field for some A/H. 
Conversely, there corresponds to each congruence divisor class group A/H 
in k a unique class field K/k, and K is abelian over k. 


In stating the next two theorems we shall denote by K the class field 
corresponding to A/H in k. 


Theorem 33 The places in k which ramify in the extension K/k are pre- 
cisely those which divide f. Moreover, if x runs through the characters of 
A/H and fy is the conductor of x, then the finite part of |], fy is just 
nOIM x /K0K/k, Where 0x/~ 1s the different of the extension. 


To define the conductor of x we form H*, the union of those cosets of H 
on which x = 1; the conductor of x is just the conductor of A/H!. 


Theorem 34 The Galois group of K/k is isomorphic to A/H. This iso- 
morphism is canonically realized by means of the Artin symbol (Hit) ; the 
map ar (i) is an epimorphism A — Gal(K/k) with kernel H. 


The second sentence is the Artin Reciprocity Law. It was only dis- 
covered a generation later than the rest of classical class field theory was 
formulated (and some five years after the rest was proved); but as soon as 
it was discovered it was recognized as the central result of the theory. It 
contains all previously known reciprocity laws. 


Corollary Let p be a prime ideal in k not dividing f. Then p is a product 
of g prime factors B in K, each with norm /,B = p/ where fg = (K/k) 
and f is the order in A/H of the ideal class containing p. 


These theorems enable one to describe the factorization in K of primes in 
k in terms of objects in k. To do this, we need to find the H associated with 
a given abelian extension A’/k. Since we know the conductor of H (or, more 
easily, the primes which divide it) by Theorem 33, there are only a small 
number of possibilities for H; and all but one of these can be eliminated by 
considering the factorization of a few small unramified primes. (Compare 
Exercise 5.1.) Implicit in Theorem 34 is the generalization of Dirichlet’s 
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theorem on primes in arithmetic progression; the deduction of this from 
Theorem 34 can be found in §16. 


Theorem 35 Each coset of H in A contains an infinity of prime ideals. 


As the theory developed, it became clear that Hilbert’s original question 
was somewhat peripheral. But the theory does provide an answer to it, as 
follows. ‘ 


Theorem 36 Let H be the group of principal ideals in A. Then f = (1) 
and the class field for A/H is the absolute class field in the sense of Hilbert. 


These theorems do not however tell one anything about the group of units 
in K or the class number of K; and broadly speaking it is only when one 
can construct class fields explicitly that one can obtain any information 
about these. For arbitrary k, the construction of K from H is the one 
major unsolved problem in the theory; but it should be remembered that 
in mathematics not all problems have solutions. The simplest examples of 
what is known are as follows. 

According to the Kronecker-Weber Theorem, the abelian extensions of Q 
are just the cyclotomic fields — that is, the subfields of some Q(e), where 
€ is a root of unity. Analytically « = exp(2mim/n) for some m,n in Z, so 
that € can be regarded as the value of the periodic function exp(z) at a 
division point — that is, at a value of z which is a rational submultiple of 
its period 27%. In the 1890s elliptic modular functions were a major growth 
industry (as they have now become again); and the work of Kronecker and 
Weber, and of their students, strongly suggested that all abelian extensions 
K of complex quadratic fields k could be generated by values of certain of 
these functions at distinguished points, and that if p is a prime ideal in a 
complex quadratic field k then the factorization of p in K depends only on 
congruence properties of p in k; but a complete proof of these assertions 
had to await the work of Takagi. These results have been generalized to 
other (but still rather limited) types of algebraic number field k by Shimura 
and others, using the theory of complex multiplication of Abelian varieties 


Some of the results above can be put into analytic form, in terms of 


L-series and zeta functions. (There was a period when strenuous attempts 
were made to rid class field theory of all taint of analysis, but the pen- 
dulum has now swung back and the importance of the analytic aspect is 
recognized.) Let s be a complex variable, and take Rs > 1 so that all 
problems of convergence are trivial. Let x be any character on A/H with 
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conductor fy. We can lift ~ back to a congruence character mod f,, that 
is, a homomorphism 


x: Ag — {complex numbers of absolute value 1}- 


whose kernel contains AR . Set x(a) = 0 for all integral ideals a not in A; 
and write 


7 x(a) x(p)_\~ 
L(s,x) = >> (Normaye I] ( z at) 


where the sum is over all integral ideals a in k and the product is over all 
prime ideals p in k. This is the classical (Dirichlet) Z-series; the equality 
of sum and product is just the Unique Factorization Theorem. In the 
particular case when x is the principal character, so that fy = (1), we 
obtain the Dedekind zeta function 


Ck(s) = ) | (Norma)~* = [Ja — (Norm p)~*)~! 


where the sum is taken over all integral non-zero ideals a and the product 
over all prime ideals p. In this language we can restate Theorem 33 and 
the Corollary to Theorem 34 together in the following form. 


Theorem 37 ¢x(s) = [| L(s,x) where the product is over all characters 
of A/H. 


Theorems 32 to 34 are the main theorems of classical class field theory; 
it was essentially a global theory, since the importance of local class field 
theory did not become apparent till later. Closely linked to the development 
of classical class field theory was the study, largely inspired by Hasse, of 
local-to-global questions. These can appear in two ways, though they are 
not essentially different: 


(i) If a condition is satisfied (for example, if an equation is soluble) in 
every k,, does that imply that it is satisfied in k? 

(ii) Suppose we have a set of objects O defined with respect to k, such 
that the completion map k — k, sends O to an object O, defined 
with respect to ky. Given a collection of O, for v in S, where S 
is some subset of the set of all valuations, is there an O of which 
they are all images? 


In each case, if the answer is negative, can we describe the obstruction? 
The simplest example of a positive theorem which answers a question of 
this kind is the Hasse Norm Theorem: 
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Theorem 38 Suppose that K/k is cyclic and a in k is everywhere locally 
a norm for K/k — that is, a is in the image of the map K @ ky > ky 
induced for each v by completion from the norm map K — k — then & is 
the norm of an element of K. 


If K/k is merely abelian then this result need not be true; for example, 
if we take k = Q, K = Q(V13, V17) then —1 is a norm everywhere locally 
but it is not a global norm. 


18 Chevalley’s reformulation 


It still remains true that for many applications the classical language is the 
most convenient. But there were several unsatisfactory features about the 
original formulation of class field theory. That it only gives information 
about abelian extensions is probably a fact of life; though there is now a 
subject called non-abelian class field theory, it does not provide answers to 
the most obvious questions — perhaps because there are no good answers to 
them. But sometimes (as with Theorem 38) the classical theory only gave 
information about a restricted class of abelian extensions. There were also 
aesthetic objections: to the substantial role which analysis played in the 
original proofs, to the ugly definition of the congruence divisor class group 
A/H and to the complications of the group-theoretic arguments involved 
in the original proofs. 

These were the reasons for a reformulation in terms of homological alge- 
bra. This produced some new theorems which could be stated in classical 
language — for example it provided a description of the group 


{everywhere local norms}/{actual norms} 


which is the obstruction to the obvious generalisation of Theorem 38. (In- 
deed, one now instinctively assumes that all obstructions are best described 
in term of cohomology groups.) In this language the classical theorems be- 
came statements about certain cohomology groups, which remained valid 
for normal extensions which were not necessarily abelian; however, it was 
only in the abelian case that these cohomology groups could be described in 
down-to-earth terms. At the same time Chevalley introduced the concept 
of idéles. In the language of idéles A/H can be replaced as follows. Let 
F be any open subgroup of J, containing k*; by definition FH has finite 
index in J,. Chevalley showed that there is a natural one-one correspon- 
dence between pairs A, H and subgroups H , and that this enables one to 
reformulate most of the main theorems of classical class field theory. 
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This led to a more radical reformulation, which also takes into account 
infinite abelian extensions. Let S be a finite set of valuations which con- 
tains all Archimedean valuations; S is related to, but is not necessarily the 
same as, the set of places which occur in m in Weber’s definitions. De- 
note by IS the group of those fractional ideals which do not involve any 
prime ideal in S in their factorization, and by J® the group of those idéles 
whose components are 1 at each place in S. These are subgroups of I,, J; 
respectively, and J® is closed in J,. The natural map J, — I, already 
introduced on page 50 induces a continuous map J — I*, which we write 
in the form € ++ (€)5. It is also convenient to write F. K/k for the Artin 
map ar (X44), which is a homomorphism IS — Gal(K/k) provided S 
contains all ramified primes. Theorem 34 can now be rewritten in the fol- 
lowing apparently weaker form, in which we fix an abelian extension K/k 
and S is assumed to contain all the primes which ramify in K/k. 


Theorem 39 There exists « > 0 such that Fx /,((€)) is the identity for all 
€ in k* for which |\€ — 1||, < € for each v in S. 


For suitable ¢ these conditions imply € = 1modm, so that Theorem 
39 does follow from Theorem 34. The converse, that Theorem 39 implies 
Theorem 34, is less easy. The key fact is that Fx, is trivial for all ¢ 
which are norms for K/k, and hence (after Theorem 39) for all £ which are 
locally norms for every v in S. Now Theorem 34 follows from local class 
field theory, which is much easier than the global theory. 


At this stage Hecke’s Gréssencharakters enter the picture. These were 
originally introduced as the most general homomorphisms 


x: I5 — {complex numbers of absolute value 1} 


for which Hecke’s proof of the functional equation for the associated L- 
series is valid. One form of the condition on x is (91), as was shown in 
Chapter 4; this is equivalent to the apparently weaker condition that for 
any neighbourhood N of 1 there exists an € > O such that the inverse 
image of NV contains all principal ideals (€) with € satisfying ||f — 1l|. < € 
for each v in S. (See Exercise 4.3.) This led to the idea of an admissible 
map. Let G be any commutative topological group; then a homomorphism 
¢: IS — Gis called admissible if to any neighbourhood N of the identity 
in G there corresponds an e > 0 such that $((€)) is in N for any € in k 
with ||€ — 1||, < € for each v in S. Theorem 39 can now be rewritten in the 
form 
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Theorem 40 The Artin map Fx /, : IS — Gal(K/k) is admissible. 


So far, we have been considering a single abelian extension K/k. If 
instead we consider all finite extensions of k and the corresponding Artin 
maps, it becomes evident that the natural language to use is the language 
of idéles. The detailed translation depends on 


Lemma 35 Suppose that G is topologically complete, and that the homo- 
morphism ¢ : IS — G is admissible. Then there is a unique continuous 
homomorphism w : J, — G such that 


(i) w(€) = 1 for each € in k*, 

(ii) W(€) = 9((€)®) for each & in JS. 

Conversely, let x be any continuous homomorphism J, — G which is trivial 
on k*, and suppose that there is a neighbourhood of the identity in G which 
contains no non-trivial subgroup; then y= comes from some set S and some 
associated admissible map ¢: I5 — G. 


Such a w induces a continuous homomorphism C, = J;,/k* — G where 
Cx is called the idéle class group; we call this map w too. In particular, 
Theorem 40 is equivalent to the existence of a 7 which also satisfies 


W(E) = Ficye((€)°) for all € in J°; 
such a w, which is unique by the lemma, is called the Artin map px/,. 


Now the main theorems of class field theory take the following form. 


Theorem 41 To every finite abelian extension K/k corresponds an Artin 
map py: J, + Gal(K/k). This is an epimorphism whose kernel is precisely 
the open subgroup k*(Normyx;,(Jx)); and it induces an isomorphism 


C,/Normxx(Cx) R Gal(K/k). 
IfL> K Dk with L abelian over k, then there is a commutative diagram 


c, “4 Gal(L/k) 
WK /k 
c, VE Gal(K/k) 


where the right hand arrow is restriction to K. Conversely, if N > k* is 
an open subgroup of J, then there is a unique abelian extension K/k such 
that N is k*(Normx %(Jx)), the kernel of the Artin map Wx/x.- 
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The commutative diagram above enables us to take the inverse limit 
as K runs through all abelian extensions of k. In this way we obtain a 
continuous homomorphism 


We: Ch > Gal(k*>/k) 


where k® is the maximal abelian extension of k and the Galois group has 
the topology proper to an inverse limit — that is to say, a base for the open 
neighbourhoods of the identity is given by the subgroups of finite index. 
The map 7 is onto, and its kernel is just the connected component of C,. 
Unfortunately, for general k very little is known about this kernel. 


19 Reciprocity theorems 


It is natural to ask whether the Quadratic Reciprocity Law (Theorem 25) 
can be generalized to arbitrary algebraic number fields k and to higher pow- 
ers. This is possible, subject to one important restriction on k. Throughout 
this section, m > 1 will be a fixed integer and we shall require k to con- 
tain the m-th roots of unity. Denote by S the set consisting of the infinite 
places of k and those primes of k which divide m; and note that if p is 
not in S then X™ — 1 splits completely in k and therefore in 0, /p, whence 
Norm p = 1 modm. 

For @ in k* let K = k(e) where e™ = a; denote by S(q) the union of 
S and the set of primes at which a is not a unit, and define the power 


for any prime p of k which is not in S(a). If we extend this definition by 
multiplicativity to (2) for all 6 in J5( then (2) is an m-th root of unity 
which is unaltered by replacing e by another m-th root of a. We call (f) 
a power residue symbol because ($) = 1 is equivalent to eNo'™P = emodp, 
hence to emodp lying in o,/p and so by Hensel’s Lemma to @ being an 
m-th power in kp. 

Now suppose that a’ is also in k and that K’ = k(e’) with (e)™ = a’, 
and write L = KK’. By Theorem 16 (44) is the restriction of (424) to 
K and so on; and therefore 


(3) <= (HE) (tt 
(C4 
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where Lo = k(ee’). It follows that 


OG) as 


provided the right hand side is defined. Again (7A )e = €Norm? mod p 
implies 

(2) = a(NormP-1)/™ mod p. (102) 
This determines (3) uniquely as an m-th root of unity; moreover it gives 
an alternative proof of (101), and it shows that if b is integral then (2) 
only depends on a mod b. 

Now let A/H be the congruence divisor class group corresponding to 
K/k. It follows from Theorem 33 that the only finite primes which can 
appear in f, the conductor of A/H, are those in S(a). Let y be in k*; then 
+ is in HT if y is in (k5)™ for each v in S(q), and it follows from Theorem 


(3) -(@)- 


The theory can be given a more symmetric form in terms of the Hilbert 
norm residue symbol. This was introduced by Hilbert in the case m = 2, 
by means of the much simpler definition (105) below, and by Hasse in 
general. The first step is to define the local Artin map corresponding to 
the abelian extension k(¢)/k; because it costs nothing, we shall do this for 
an arbitrary abelian extension K/k. Let G = A/H be a congruence divisor 
class group in k, with conductor f, and let S be the set of prime ideals 
which divide f. Denote by JS = A the group of those fractional ideals 
whose prime factorizations involve no ideal in S, and by J® the group of 
idéles whose components are 1 at each place in S. 


Lemma 36 In the notation above, for each place v of k there is a unique 

homomorphism fy :k§ + G such that 

(i) if p is a prime ideal not in S then fy is trivial on 05, 

(ii) f =[] fo is well-defined and continuous on J, 

(iii) if & is in J then f((€)) is the image of (€) in G, where (€) is as on 
page 50, 

(iv) f(8) =1 for each B in k*. 


Proof If p is not in S we must take f,(ap)) to be the image of p" in G where 
p"|la,; this ensures (i) and (iii), and also (ii) provided the f, for v in S 
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are continuous. Let N > 0 annihilate G and temporarily let the f, with v 
in S be any homomorphisms k* —+ G. Then f({) is the identity provided 
that is in k* and in (k%)% for each v in S, for then Theorem 17 implies 
that we can write 8 = BN B with B in Jp, Bo in k* and G2 = 1modf — so 
that (G2) isin H. The natural map k* — [], ;, ¢(k3/(k3)%) is onto, using 
Theorem 17 again, and it follows that there is just one choice of the f, with 
v in S for which (iv) holds. Moreover these fy are continuous because the 
ks /(k2)% are finite. Oo 

The local Artin map ~, : k* — A/H — Gal(K/k) is obtained by com- 
posing fy, with the isomorphism of Theorem 34. It also follows from the 
corollary to that theorem that if a is a norm for K/k and v is not is S then 
fu(a) is the identity. Using Theorem 17 yet again, the construction in the 
proof of the lemma implies that this holds also if v is in S. 

The Hilbert symbol (a, 3), for a, G in k* is now defined by 


wy (Be = (a, B)v €; 


we shall shortly prove that it extends to a continuous function on k3. The 
value of (a, 3), is again an m-th root of unity, and is unaltered by replacing 
€ by another m-th root of a. It is trivial that 


(a, BB’)» = (a, B)» (a, By 
and an argument like that which was used to prove (101) gives 


(aa, 8)» = (a, B)y (a’, B)v- 


Lemma 37 Each of the symbols (a,—a)y and (a,1— a), is equal to 1 
whenever it is defined. 


Proof Write d = m/[K : k], so that the conjugates of ¢ over k are the (+e 
with d|uz where ¢ is a primitive m-th root of unity. For any y in k, 


y™ -a= TT, 0 — Ce) = norm K /k {3 = coh ; 


so wy(y™ — a) is the identity and (a, y" — a)y =1. Nowset y=0,1. O 


Corollary For any a, in k* we have (a, 8), (8,a)y = 1. 
Proof By bilinearity 
(af, —A8)y = (a, —&)y (8, —B)v (By (8, &)o 
and all but the last two terms are equal to 1. QO 
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It follows that the Hilbert symbol induces a skew-symmetric bilinear map 
ks /(ke)™ x ki /(k5)™ — {m — th roots of unity}. (103) 


For by the definition of y, we can take the second argument to be in k*, 
and even in k*/(k%)™ because the Hilbert symbol is killed by m. But 


BE (0 (B5)) 5 /(5)™ 


is onto, so corresponding to any @ in k* we can find () in k* such that 
(a, 8)y = (a, B1)y for all a in k*. Thus we can use the corollary to extend 
the range of a from k* to kj. 

The other key property of the Hilbert symbol is 


[],(@:4)e =1, (104) 


which follows from (iv) of Lemma 36. The product is well-defined because 
(a, 8)y = 1 whenever a, G are both in 0% and v is not in S. 


Theorem 42 The Hilbert symbol, regarded as a bilinear form (103), is 
non-degenerate. 


This is a key result in local class field theory. For v in S we cannot prove 
it in this book, essentially because of the rather indirect construction of f, 
in the proof of Lemma 36. But if v = p is not in S the result is trivial. 
For suppose @ is in 0} and pl|7; then by construction (a,7), = 1 if and 
only if p splits completely in k(e)/k, which by Theorem 19 is the same as 
saying that q is in (of)™. If p"lla with mln but a@ is not in (kj)™ then 
(a, 7)» # 1; if mn and r is the highest common factor of m and n, then 
(a, B)p #1 if B is in of and not an (m/r)-th power. 


Theorem 43 Let L be abelian over ky; then the group of norms from L to 
ky has index (L : ky] in kj. 


This result again belongs to local class field theory, though elementary 
proofs exist. We quote it because the special case [LZ : ky] = 2 implies the 
important formula (105) below; but even in this case the elementary proof 
is no more than a tedious and unilluminating verification. 

It is reasonably easy to evaluate the Hilbert symbol when v is not in 
S. If v is Archimedean the fact that k contains the m-th roots of unity 
implies that (a, 3), = 1 except perhaps when m = 2, v is real and a, @ are 
both negative; in this exceptional case (105) shows that (a, 3), = —1. The 
non-Archimedean cases are covered by the following lemma. One relatively 
simple case for which v is in S is given in (105), and another can be found 
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in the exercise later in this section. For any particular & the Hilbert symbol 
is given by a table which can easily be constructed with the help of (104). 
For an illustration of this, see Exercise 5.5. 
Lemma 38 Suppose that p is a finite prime of k not in S; then 

(a, B)p = ie) where y= (—1)*(2)*B) q2(8) g-¥(@) | 
Here y is a unit at p, so that the value of (7) is given by (102). 


Proof It follows from the construction of yp that (a, 8), = 1 for a, B in 03; 
and if p||7 then (7, —7),) = 1 by Lemma 37, and (a, 7), = (©) by definition 
if @ is in of. Write a = 1a, B = 1”) 89. Using the results of the first 
sentence and the bilinearity of the Hilbert symbol 

(2, 8)p = (1, Bo)p\* (a0, m)p9)(—1, 2) 9" 
and the result now follows from the Corollary to Lemma 37. QO 


For any a, in k* denote by (3)5(® the ideal obtained from (8) by 
deleting any prime factors which lie in S(a), and write 


(5) - (ae): 


note that despite (101) it is not always true that (337) = (G)()- Then 
we have the general power reciprocity law: 


Lemma 39 If a, 8 are in k* then 


(3)(2)" Tee. 


where the product is taken over all v in S(a) 1 S(f). 


Proof Using the Corollary to Lemma 37 we have 


63) 2)" = {Taal} Teac an) 


= Il (a, B)>» 


p not in S(a)NS(B) 


because the terms which occur in both products are each equal to 1 as in 
the proof of Lemma 38. Now use (104). O 


Now suppose that m = 2, but place no restriction on k. The case m = 2 
is particularly favourable, because in order to give a formula for (a, 3), it 
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is enough to give a necessary and sufficient condition for (a, 8), =1. Such 
a condition is as follows: 


(0,8)y = 1 if aX? + BY? = 1 is soluble in k,, 
yew" | -1 otherwise. 


This follows easily from Theorems 42 and 43. For we can assume that q 
is not in (k*)? and write L = k,(./a). The proof of Lemma 37 shows that 
(a, 8)y = 1 if B is a norm for L/ky, which is equivalent to the solubility of 
aX? + BY? = 1; and Theorem 43 shows that for fixed a the @ for which 
this holds form a subgroup of index 2 in k*. It now follows from Theorem 
42 that (a, 8), = —1 if @ is not a norm. 

What underlies this result is the much more general fact that central 
simple algebras over k can be classified by means of the Hilbert symbol; 
see [Weil], Chapters IX to XI or [CF], pages 137-8. Denote by A the 
quaternion algebra with norm form 


X3 ~ aX? — BX} + apXx?; (106) 


(105) 


then each of the two following conditions is necessary and sufficient for 
A® ky to be isomorphic to the algebra of 2 x 2 matrices with elements in 
ky: 


(i) the quadratic form (106) has a non-trivial zero in ky; 
(ii) (a, B)v = 1. 


It is easy to see that (i) is equivalent to the solubility of aX? + BY? = 1 
in ky. 


Exercise Let m = p be an odd prime and let k = Q(C) where ¢ is a 
primitive p-th root of unity; thus the only finite place in S is p = (7) 
where 7 = 1 —(¢. The object of this exercise is to provide as Lemma 40 a 
straightforward way of computing the Hilbert symbol (a, 8)». 

Write 7, = 1-2" for r=1,2,... , let U, be the group of elements € in 
ky satisfying « = 1 mod” and let Up = 05. Show that 


(i) the image of 7 generates kj /o5, 

(ii) oF = (05)?U1, 

(iii) for each r the image of n, generates U,/U;+1, 
(iv) every element of Up41 is in UP. [Use Lemma 20.] 


Deduce that 1 and m,-.. Np generate kj /(k>)?. This means that we need 
only evaluate (a, 3)p when each of a, @ is 7 or one of m,... , Mp; and this 
can be done by induction from the following lemma. 
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Lemma 40 [f p21, v 21 then 


(Ny, nv)p = (Nu Nu+v)p (Ny+v, Nv)p (x, Nuty)p (mv; ™)5 (107) 
and in particular (Nu, v)p = 1 if u+v > p. Moreover (1,7), =1 and 


jl wfl<p<p, 
(we={ ¢ if p=p. 


Because p is odd, (a,a), = 1 for all a in kj by Lemma 37. For (107) 
apply Lemma 37 to 8 = nv /nu+v, noting that 1 — 6 = m’n,/nu+r, and 
use bilinearity. If u+v > p then n+. is in (kj)? and so all symbols 
involving it are equal to 1. If 1 < pw < p it follows from Lemma 37 that 
(Nus™)p = (Nu, 7")p = 1; so the only difficulty comes with the very last 
assertion, Taking vy = 1, p = p —1 in (107) gives (C,p-1)p = (7, 7p)p 
because each of the other factors is 1. If a = ¢ in (102) the congruence 
must actually be an equality because it cannot be satisfied otherwise; now 
use (104) and Lemma 38 to show that 


(¢,B)p = Il ¢2(8)(Norm q-1)/p 
q 


for any in o*, where q”‘9)||@ and the product is taken over all prime ideals 
q of k other than p. Finally, prove that under the same conditions 


>,0(8) (Norm q — 1) = Norm ({) — 1 mod p” 


and Norm (np-1) = Norm ((1— 7?~1)) =1-—Trn?-!=1—pmodp?. QO 


20 The Kronecker-Weber Theorem 


It follows from the Corollary to Theorem 27 that every cyclotomic field is 
a class field; conversely, Theorem 16 and Theorem 27 enable us to com- 
pute the Artin symbol for any cyclotomic field, and it follows that to each 
congruence divisor class group there corresponds a unique cyclotomic field 
which is a class field for it. In particular, the field of m-th roots of unity 
corresponds to Hmo. But this is not enough to prove that these are the 
only class fields over Q. This assertion is the Kronecker-Weber Theorem 
(Theorem 44). Its proof is elementary but complicated; it exploits many 
of the ideas in Chapters 1 and 2, as well as Theorem 27 and its Corollary. 
The fact that every extension of Q is ramified at some prime (Corollary to 
Theorem 21) plays a crucial role. 

In this section we shall denote by C(n) any cyclic group of order n. Let 
K be a cyclic extension of Q with G = Gal(K/Q) = C(p") where p is 
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prime, let G, be the unique subgroup of G of index p and k the fixed field 
of G,, so that k is cyclic over Q with Gal(k/Q) = G/G, = C(p). Let 
@ (which may be equal to p) be a prime which ramifies in k. The inertia 
group of @ for k is the entire Galois group G/G1; since any proper subgroup 
of G is in Gj, it follows from Theorem 16 that the inertia group of @ for 
K/Q is the entire Galois group G. Hence @ is totally ramified in K — 
that is, e = p” and f = g = 1. Such primes are the key to the following 
arguments. 


Lemma 41 Let K/Q be cyclic with G = Gal(K/Q) = C(p"). Then there 
are a cyclotomic field L and a field K’ with Gal(K'/Q) = C(p’) ands <r 
such that LK = LK' and the only prime totally ramified in K’/Q is p 
itself. 


Proof Suppose that @ # p is totally ramified in K; if there is no such @ we 
can take K’ = K. In the notation of §5, V is trivial because [K : Q] is 
prime to £; hence T = G is cyclic of order dividing ?— 1 by Theorem 15, so 
that £= 1modp". Now let L be Q(W/1); then KL is abelian over Q and 
its Galois group is a subgroup of C(p") x C(é—1). Let K, be the inertia 
field of £ for KL/Q. As before, the corresponding ramification group is 
trivial, so Gal(K L/K1) is cyclic; its order must be a multiple of £—1 by 
Theorem 27 applied to KL > L > Q. So Gal(KL/K,) ~ C(é — 1) since 
Gal(KL/Q) contains no larger cyclic subgroup than C(é— 1). A’ prime 
other than £ which ramifies in K, must ramify in KL and therefore in K, 
for only @ ramifies in ZL. No prime ramifies in K; 1 LZ; so Ki NL = Q, 
and now K,L = KL by a comparison of degrees. But Gal(KL/Q) has 
Gal(L/Q) = C(é—1) as a quotient, so it has the form C(p*) x C(€—1) for 
some s <r; and hence Gal(K1/Q) + C(p*) by Lemma 25. 

If K, does not have the properties required of K’, then we can repeat 
the process again on K,. This process must eventually end, because one 
fewer prime is ramified (not necessarily totally) in K, than in K. oO 


Lemma 42 (i) Let p be an odd prime and let K be a field cyclic over Q 
with Gal(K/Q) = C(p). If p is the only prime which ramifies in K then 
K is a subfield of the field of p*-th roots of unity. 

(ii) If [K : Q] = 2 and 2 is the only prime which ramifies in K, then K 
is Q(V2), Q(V=1) or Q(V—2), all of which are subfields of Q( V1). 


Proof After the results in §10, (ii) is trivial; so we need only consider (i). 
Let L = Q(#/1); then Gal(KL/L) * C(p) since K and L have coprime 
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degrees over Q. By a standard result in Galois theory, KL = L(a) where 
a? = f is in L. Choose a generator o of Gal(KL/Q) = C(p(p — 1)); then 
K is the fixed field of o? and L is the fixed field of o?~!. Thus o? 1g = 8 
and hence o?~'a = Ca where ¢? = 1; but ¢ ¢ 1 because we can assume 
that a is not in L. As in §13, we write r = 1—¢. Since o induces a 
generator of Gal(L/Q) there is a primitive root n mod p such that a¢ = ¢" 
and hence g?~!(ca) = o(a?-'a) = C"(ca); in other words, (ca)/a” is 
invariant under ¢?—! and is therefore in L, and hence (o8)/" is a pth 
power in L. 

It follows that (oa)/a is not in Z and therefore generates KL over L. 
The prime ideal (7) in Z is invariant under a, so (cf)/f is a m-adic unit. 
Replacing a by (oa)/a and § by (af)/, we can assume that £ is a m-adic 
unit, and by further replacing a by a!~? = a/f we can also assume that 
8 = 1mod7; this means that 9 = ¢*y where y = 1 mod 7? and therefore 


y=1+cen™" modr™* 

for some m > 1 and some c in Z prime to p. For 6 in the local field L,, 
Lemma 19 shows that 6 = 1modz7 is a pth power in Lj if and only if 
5 = 1mod7?. Since (ay)/y" = (c8)/f" is a p-th power in L and so also 
in Ly, 


oy = y" mod7?. 


But on = 1—¢" = n(1 —() mod 7? and therefore 


oy =1+cn™n™ mod r™*?, 
The last three displayed equations together imply that either m > p or 
cn™ = cnmodp, and the latter also requires m > p since m > 1 and nisa 
primitive root mod p. It follows that y = 1 mod 7?. 

To prove the lemma, it is enough to show that y is a pth power in 
L. Since K, = L( 2/7) is in K( */1), no prime other than p ramifies in 
K,/Q and K,/Q is abelian. But by Lemma 30 and the result above, (7) 
does not ramify in K,/L. Let k be the inertia field of p for K,/Q; then 
[K, : k] = p—1 and k = Q because no prime ramifies in k/Q. Thus 
[1 : Q] =p~1 and 7 is a pth power in L. O 


Theorem 44 Every abelian extension of Q is cyclotomic. 


Proof We prove this by induction on the degree of the extension. Let K be 
abelian over Q with Gal(K/Q) = G. If G is a non-trivial direct product 
G x G2 then K is the least field containing the fixed fields of G, and of Go, 
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which are abelian extensions of Q of degrees lower than that of K. Hence 
we may assume that G ~ C(p") where p is prime. In view of Lemma 41 
we can assume that p is the only prime which is totally ramified in K, and 
in view of Lemma 42 we can further assume that r > 1. 

Consider first the case when p is odd; let K, be the unique subfield of 
K with [K, : Q] = p and let L be the field of p"*!-th roots of unity. Since 
KOL2D K, by Lemma 42 and Gal(L/Q) = C(p"(p — 1)), 


H = Gal(KL/Q) = C(p"(p—1)) x C(p") for some r’<r (108) 


because H is a proper subgroup of C(p"(p — 1)) x C(p") which admits a 
C(p"(p — 1)) as a quotient group, Let H; = Gal(K L/L); then 


H/H, = Gal(L/Q) = C(p" (p — 1)); 


by (108) Hj is a direct factor of H. If H = H, x Hp then the fixed field 
of H2 has degree po <p” over Q, and KL is the compositum of it and L. 
Hence by the induction hypothesis K is cyclotomic. 

When p = 2 this argument needs some modification. The Galois group 
over Q of the field of 2"+?-th roots of unity is C(2) x C(2"), and hence this 
field has a subfield L with Gal(Z/Q) = C(2"). Now the argument used for 
p odd will still work, provided we show that the unique quadratic subfield 
of K is Q(/2); for since L satisfies all the conditions imposed on K, this 
will hold for L also. If K is real this is trivial. If K is complex it admits 
complex conjugacy as a non-trivial automorphism of order 2; all its proper 
subfields are fixed under this and hence are totally real. O 


Appendix 


Al Prerequisites 


In this section we prove the standard results about finitely generated abelian 
groups and lattices, and about norms and traces in field extensions of fi- 
nite degree, for the benefit of those readers who do not know them already. 
They are prerequisites for everything in this book. We also give a brief ac- 
count, without proofs, of the definition and key properties of Haar measure. 
Logically, this is not essential for the arguments in most of the book, but 
it underpins the point of view adopted. For a really single-minded reliance 
on Haar measure, see [Weil]. 


Al.1 Finitely generated abelian groups and lattices 
In this subsection, all abelian groups will be written additively. 


Lemma A1 Let G be a finitely generated torsion-free abelian group such 
that 


(i) G is generated by 21,... , Zn, 
(ii) G cannot be generated by less than n elements. 


Then there is no non-trivial relation a,21 +--:+@n2n =0 with the a, in 
Z. 


Proof Assume that there is such a relation, and among all sets of n gen- 
erators and all non-trivial relations as above choose that one for which 
A = |a;|+---+|@n| is least. We have two cases, either of which will give 
a contradiction. 


(a) At least two of the a, are non-zero. By permuting subscripts and 
changing signs we can assume that a; > a2 > 0. Now consider the 
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base 21,21 + %2,23,... for G. In terms of this base our relation 
becomes (a1 — a2)21 +.a2(z1 +22) +--- = 0, and this has a smaller 
value of A. 


(b) Only one a, is non-zero. We can take this to be a1, so that the relation 
is a,x, = 0. Since G is torsion-free, x, = 0; so G is generated by 
the n — 1 elements r2,... ,tn- 


These contradictions prove the lemma. O 


We call a base having the properties in Lemma Al a minimal base. 
Every finitely generated torsion-free abelian group has a minimal base, for 
we need only choose a base consisting of as few elements as possible. 


Lemma A2 Let G be a finitely generated torsion-free abelian group, and 


let H be a subgroup of G. Then there exist a minimal base x1,... ,In of G 
and integers m,,...,M, for some r <n such that 

(i) the m, are positive and m,|mMpi1 forp=1,...,r—1, 

(ii) m121,...,M,Zr are a minimal base for H. 


If H has finite index in G then r =n. 


Proof The proof is really by induction on n, but because of the form of the 
induction hypothesis it is better not to state it in that way. We can assume 
that H is non-trivial. Let yi,...,yn be any minimal base for G and let 
h = ayy; +--+ +@nyn be any non-zero element of H. Making an integral 
unimodular transformation on the y, cannot decrease the highest common 
factor of the a, and is an action which can be reversed; so it leaves the 
highest common factor unchanged. Now choose h to be a non-zero element 
of H for which this highest common factor is as small as possible, and then 
choose that minimal base y1,... ,Y¥n for G for which A = |a;| +--+ + |an| 
is as small as possible for this particular h. If two or more of the a, were 
non-zero, we could decrease A just as in the proof of Lemma A1; hence 
after permuting subscripts we can assume that a; = m,; > 0 and all other 
a, vanish. Let z = b1y; +--: + bny, be any element of H; then 


@ m\b; since if 0 < b} — cm; < m, for some c in Z then z— ch would give 
rise to a smaller highest common factor than h, 

© boye+::+ + bn¥n = z— (b1/m1)h is in H, 

@ m,\|b, for each v, for otherwise m,y, + beyo +---+ ban would give rise 
to a smaller highest common factor than h. 
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Now let G, be generated by yo,..- , Yn, let Hy = GiNH and write x, = y. 
We have shown that 


G={z,}OGi, H={m,2x}0 Aj. 


If we repeat the process on G; and Hj, we shall have m;|m2 by the last 
bullet point. We stop when H is exhausted. If r < n then all the ma, 
would generate distinct cosets of H, and H would not have finite index in 
G. QO 


Theorem Al Let G be a finitely generated abelian group. There exist 


elements x1,...,2n of G and integers my,...,m, for some r <n such 

that 

(i) Mptp =0 forp=1,...,r, 

(ii) each m, > 1 and m,|mpi1 for p=1,...,r—1, 

(iii) each element of G can be written uniquely as a12,+---+Qn2q_ where 
the a, are integers and0 < a, < m, for p=1,...,r. 


Moreover G uniquely determines n and the m,. 


Proof To prove existence, let y,,... ,yn be any generators of G, and let 
G* be the free abelian group on the N generators Y;,... ,Yn. There is 
a natural epimorphism G* — G obtained by mapping each Y, into the 
corresponding y,; let H* be its kernel. Since G* is torsion-free, we can 
apply Lemma A2 to G* and H%, obtaining a base Xj,... , Xn for G* and 
integers M,,... , Mp; let x, be the image of X, in G. The z, generate G, 
and 6)2; +--+ bnzy = 0 if and only if M,|b, for each p <r and b, =0 
for each p > r. Thus we have achieved all our claims except the statement 
that M, > 1. But if some M, = 1 the corresponding z, = 0, so that x, 
can be deleted from the base for G which we have just constructed. 

To prove uniqueness, suppose that we have a second such representation, 
given by dashed letters. We first prove that n =n’. For if not, let n > n’ 
and let p be a prime dividing m,. Using the undashed representation we 
have an obvious epimorphism from G to the n-dimensional vector space 
over F,; hence this space must be generated by the images of the z!,. This 
is absurd because the set which they generate contains at most p™ < p” 
elements. 

Now for any m > 0 consider the group mG consisting of all mz with z 
in G. This has the representation in the theorem if we replace the x, by 
mz, and the m, by m,/(m.,m), where the mz, for which m,|m must 
be deleted. Hence m, is invariantly defined by the property that mp is 
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the least m for which a canonical representation of mG uses at most n— p 
generators. QO 


Corollary Any subgroup of a finitely generated abelian group G is finitely 
generated. 


Proof As in the proof of the theorem, construct a finitely generated torsion- 
free group G* and an epimorphism G* — G. If H is a subgroup of G, let 
H™* be the inverse image of H. Then H™ is finitely generated, by Lemma 
A2, and the images of the generators of H* generate H. O 


Now let V be Q” or R” with the standard topology, and write k = Q 
or R respectively. A lattice A in V is a Z-module which satisfies any two 
of the following three conditions. 


(i) A spans V as a k-vector space. 
(ii) A is discrete in the topological space V. 
(iii) A is a free Z-module on n generators. 


Lemma A3 Any two of these three properties imply the third. 


Proof Suppose first that any n elements of A are linearly dependent over 
k. If S is a maximal set of linearly independent elements of A containing 
r <n elements, then the k-vector space generated by S contains A; but 
since it has dimension r it is not the whole of V, so (i) does not hold. 

We now use the pigeonhole principle. Let x1,...,2, be elements of A 
linearly independent over k, so that the z, span V; and let Ao be the Z- 
module spanned by the z,. Suppose that A is discrete in V; thus there 
is an integer M > 0 such that the only element >> A,z, of V with each 
|Av| < M-?! which lies in A is the origin. The M®” boxes of the form 
my/M < Av < (m, +1)/M, where the m, are integers with 0 < m, < M, 
cover the cube C defined by 0 < A, < 1 for each v. Let yi,...,yn be 
representatives of distinct cosets of A/Ao; by translation we can assume 
that they all lie in the cube C. Hence N < M™, for otherwise y,,y, would 
lie in the same box and y, — ys would be an element of A lying in the 
forbidden neighbourhood of the origin. Hence A is finitely generated (by 
the x, and y;), and A is a free Z-module on n generators by the last sentence 
of Lemma A2 with G = A and H = Ag. We have therefore shown that (i) 
and (ii) imply (iii). 

If (i) and (iii) hold, let {21,...,2,} be a base for A; then 2,...,2n 
span V and any point of V can be written as }*A,z, with A, in k. The 
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neighbourhood of the origin given by |A,| < 1 for each v contains no point 
of A other than the origin, so A is discrete in V. 

Finally, if (ii) and (iii) hold but (i) does not, let W be the k-vector space 
spanned by A. Then (i) and (ii) hold with W in place of V;; so (iii) holds 
with dim W in place of n, which is a contradiction. o 


A1.2 Norms and Traces 

Let K,k be fields with K > k and [K : k] = n finite, and let 2;,... ,2, be 
a base for K as a k-vector space. For any y in K the endomorphism of the 
k-vector space K given by z +> yz has a characteristic polynomial F(Y), 
Explicitly, there exist a,; in k such that 


YX; = S- ai52; (Al) 


and the characteristic polynomial of y is F(Y) = det(YI ~ A) where A 
is the matrix of the a;;. A change of base takes A into T~!AT and hence 
does not affect F; and F(y) = 0 follows by regarding (A1) as equations for 
the z;. We are particularly interested in the second and last coefficients of 
det(YI + A), which define the trace and norm of y: 


Trxe(y) = trace(A), normy,,(y) = det(A). 


If Ay, Ae correspond to 41, y2 respectively then A; + Az corresponds to 
yi + ye and A; A2 to y1y2; so we obtain 


Try: £y2) = Try: + Try2, norm(y:y2) = (normyi)(normy2). (A2) 


Lemma A4 Let K > k with [K : k] finite, let y be an element of K and let 
F,f be the characteristic polynomials of y for K/k and for k(y)/k. Then 
F= f™ wherem=([K : k(y)]. 


Proof Let x1,...,2m be a base for K over k(y). If [k(y) : k] = r then 
l,y,...,y"? are a base for k(y) over k, and therefore the z,,y? form a base 
for K over k. But if A, B are the matrices whose characteristic polynomials 
are F, f respectively, then A consists of m copies of B down the main 
diagonal, with zeros everywhere else. O 


Corollary Let y,,...,yn in the algebraic closure k of k be the distinct 
conjugates of y over k. Then |K : k] =ns for some integer s, and 


norm«/y = (yr+Yn)*, Trey = 8(yr +--> + Yn): (A3) 
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Ifo,,...,0n are the distinct embeddings of K into k which fiz k element- 
wise, then [K :k] = NS for some integer S, and 


norm «/-Y = [[(e», Treey = S>- (avy). 


If k(y) is separable over k then s = [K : k(y)]; if K is separable over k 
then S = 1. 


Proof The equations (A3) follow from the fact that the roots of f are the 
yy; possibly repeated, and they have equal multiplicity; if k(y) is separable 
over k then this multiplicity is 1 and n = [k(y) : k]. For each y, there 
is a homomorphism k(y) — k given by y + y,, and each of these can be 
extended to K in the same number of ways. O 


We could have defined norm and Trace by the formulae in this corollary 
— and this is what we had to do when we defined the norm of an ideal in 
§4. The crucial formulae (A2) follow at once and we do not need to mention 
the characteristic polynomial. But we would instead have to prove that the 
results lie in k — though this is trivial in the separable case, which is all 
that concerns us in this book. 


Theorem A2 Let K > L Dk be a tower of extensions, with (K : k] finite, 
and let y be an element of K. Then 


normy/,(normx;,y) = normy,~y,  Trese(Trxsty) = TrK/ey- 


Proof We use the notation of the last corollary. In the first equation, each 
side is the product of the y, each taken the same number of times; and this 
number is determined by the fact that each side is the product of [K : k] 
conjugates of y. For the second equation we need only replace ‘product’ by 
‘sum’. | 


A1.3 Haar measure 

A topological group is a group which, as a set, is equipped with a topol- 
ogy such that the group operations (multiplication and inversion) are con- 
tinuous. There is one major theorem about topological groups; and every 
serious mathematician needs to know the statement of it though not the 
proof. 


Theorem A3 Let G be a locally compact topological group. There exists 
on G a measure p, unique up to multiplication by a constant, such that 
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every compact subset of G is measurable and 
(90S) = u(S) 


for any go in G and any measurable subset S of G. There is a corresponding 
integral [, f(g)du with the property 


i f(909)dp = [ oe 


These are called the left Haar measure and left Haar integral re- 
spectively. There is a similar theorem for the right Haar measure and right 
Haar integral. Left and right Haar measure are frequently the same (for 
example if G is either commutative or compact); but this is not always so. 

The proof of this theorem is difficult; but one really uses it only as 
motivation. In the cases which occur in practice one usually needs to know 
the Haar measure explicitly and it is usually obvious what it is; but one 
hardly ever needs the uniqueness assertion. 

It is sometimes obvious how one should normalise the Haar measure; for 
example, if G is compact or has a particularly important compact subgroup 
Go then it is natural to choose p so that u(Go) = 1. Again, the natural 
normalisation of 4 on R is the one which induces u(R/Z) = 1; but there 
is no obviously best normalisation of the Haar measure on C. 

Now let K be a locally compact topological field. Associated with K 
there are two topological groups — K with the addition law and K* with 
the multiplication law — and the corresponding Haar measures p+ and p* 
are different. These are respectively the additive and the multiplicative 
Haar measures on K’. Let a be any non-zero element of K. The map 


SH p* (as) 
is an additive Haar measure on K, so it has the form S +> cgy*(S) for some 
Ca by the uniqueness of Haar measure. One frequently writes c, = |lall, 


thus defining a function from K* to the positive reals; note that ||a|| does 
not depend on the choice of y+. (By convention we write ||0|| = 0.) Clearly 


llo1a2|| = [loa] - lleall; 


and though ||.|| need not induce a metric on K, under modest extra con- 
ditions it induces a topology on K which is just the one we began with. 
Moreover 


[tear = f enerant (Ad) 


if we adjust the constants in p*, y* suitably. 
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The simplest cases of Haar measure are given by (i) to (iii) below; (iv) 
and (v) are crucial for applications to algebraic number theory. 


(i) If G is a finite group, or more generally if G has the discrete topology, 
u(S) is just the number of elements in S. If K is a finite field then 
ja|| = 1 for a £0. 

(ii) R is a field and the Haar measure on R is the standard one. Thus 
\|z|| = |x| and the Haar measure on R* which is given by (A4) is 
Sr fy |a|-dz. 

(iii) C is also a field. One possible Haar measure on C is given by area in 
the complex plane, but it is customary to double this, giving the 
measure associated with dz A dz. In either case ||z|| = |z|?; this 
does not give a metric, but ||z||!/? does. Again the Haar measure 
on C* is given by (A4). 

(iv) Let k be an algebraic number field and p a prime ideal of k; then ky 
is a locally compact field. We have ||a||, = (Normp)~™ if @ is in 
k* and p™||a. Every residue class mod p™ for fixed m > 0 has 
measure p+(o,)(Normp)~™. It might appear natural to normalise 
the additive Haar measure so that u*(o,) = 1, and this is what 
some writers do; but following Tate we shall normalize it so that 
t(op) = (Normd,)~!/? where 2, is the local different introduced 
in §8. The benefits of this will become clear in §A2.2 and in the 
Corollary to Lemma 26. The measure on V;, will be the product of 
the measures on the ky. 

(v) There are also several plausible ways to normalize the measure on kj. 
We choose to require 4*(o5) = 1, and to take the measure on 
J; to be the product of the measures on the k}. However, some 
writers prefer to take u*(o5) = (Normp — 1)/(Normp); if so, the 
set consisting of the units congruent to G mod p™ for any fixed unit 
8 and any m > 0 has measure (Normp)~™. For those who also 
take u*+(o,) = 1, the relation (A4) is preserved. 


A2 Additional topics 


This section falls into two parts. The first two subsections outline the 
theory of Fourier transforms on locally compact abelian groups, which is 
an essential foundation for §15. The third one is not used anywhere in this 
book, except for a passing reference at the end of §18; but I hope that the 
reader who pursues the advanced theory will find it useful. 
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A2.1 Characters and duality 


Denote by T the circle group |z| = 1 with the usual topology, and let G 
be any locally compact abelian group; then a character of G is defined to 
be a continuous homomorphism c: G > T. The following trivial remark 
will be used repeatedly. 


Lemma A5 Let x be a character on a compact group G; then 


_f WG) if x is trivial on G, 
[fw dis = { 0 otherwise. 


Proof We need only prove the second statement. Choose zo in G with 
x(zo) # 1 and write xz for x in the integral; then 


[ x(z) dp = 1 x(zx0) du = x(Z0) [ x(x) du, 


from which the statement follows. oO 


The characters of G form a multiplicative group, normally denoted by 
G; to write them additively, we use the identification T ~ R/Z. We can 
topologize G as follows. Fix a character co of G; then a base for the open 
neighbourhoods of co in G is given by the set of characters c on G such 
that 


le(g) — co(9)| < € for gin S 


where € > 0 and S is any compact subset of G. It is easy to check that 
with this topology G becomes a locally compact topological group and each 
element of G induces a character on G. But much more is true: 


Theorem A4 (Pontryagin duality ) We can identify the character group 
of G with G. If G is compact then G is discrete; if G is discrete then G is 
compact. If H is a closed subgroup of G and H* consists of those elements 
of G which are trivial on H, then H t is closed in G and there are canonical 
isomorphisms H = G/H* and G/H = H?. 


Corollary Any character on a closed subgroup of G can be extended (non- 
uniquely) to the whole of G. 


Note that if the G; are subgroups of G such that any neighbourhood of 
the identity in G contains some G; then for any character c on G we can 
find some G; on ae cis trivial. For let NV be the neighbourhood of 1 in 
T given by Rz > 4 ; then c~'W is a neighbourhood of the identity in G and 
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therefore contains some G;. But c(G;) is a subgroup of T contained in N, 
and must therefore be trivial because V contains no non-trivial subgroups. 
A set of such G; is called a set of small subgroups. 


We now find the character groups of some of the topological groups 
we shall encounter. All these calculations can be done in an elementary 
fashion, so that logically we shall not need Theorem A4. 

Consider first Z: now we can choose c(1) arbitrarily and define c by 
c(n) = c(1)", and any such c is continuous. Hence Z = T and by duality 
T= Z; it is easy to check directly that any character on T has the form 
z+ 2" for some n in Z by considering its kernel, which must be either the 
whole of T or a finite subgroup. Since 0, ~ Z”, its dual is 6, + T”. 

Let. k be an algebraic number field. In view of Lemma 26 we expect to 
give Q or k the discrete topology; and in fact there are no other sensible 
topologies which make these groups locally compact. We are not interested 
in all their characters, but only in those that extend to some completion; 
and these are best obtained by finding directly the characters on the possi- 
ble completions. Let k, be any completion of k and let Q,, be the closure 
of Q within k,, so that w is the valuation on Q which is the restriction 
of v. Qy must be either R or Q,. In either case there are natural maps 
tw: Qu > R/Z, being given for Q, = Q, by Q, — Q,/Z, — R/Z be- 
cause the elements of Q,/Z,p can be identified with those elements of Q/Z 
whose denominators are powers of p. Using ty we can define the function 
z+ exp(27iz) for any t in Q,. In the following lemma the signs have 
been chosen so as to simplify the corresponding statement in the adélic 
case. 


Lemma A6 We can identify ky with ky, the character cq corresponding 
toa ink, being given by 


Ca(8) = exp(+2mt Trx, /Q,,(@8)); 


where: the sign is chosen to be minus for Archimedean and plus for non- 
Archimedean. places. 


Proof Most of this assertion is easy. With the definition above, cg is a 
character; for it is a homomorphism, and it is continuous because Trace 
is so. The cg form a group algebraically isomorphic to k, under a+ cg, 
for cq trivial implies that Tr(a@) is in Z for all @, which is only possible 
if a = 0. Moreover the group of cg is homeomorphic to ky under this 
correspondence. For let Sy be the compact set |||, < M where M is 
fixed; then ||a||, small implies Tr(a) small uniformly in 6 and hence ca 
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is close to the identity character. Thus a ++ cg is continuous. Conversely, 
suppose that cq is close to the identity character. If w is Archimedean this 
means that |Tr(af) — ngl < € for each @ in Sy and some ng in Z, where 
we require € < ; If ng is ever non-zero then by replacing @ by Gx where 
|z| < 1 and zng is not close to an integer, we obtain a contradiction. So 
|Tr(a@Q)| < € for all G in Sy, and this implies that a is small with e. If 
instead w corresponds to p then |Tr(af) — ng| < € where this time ng is 
in Z, and we shall require « < $i if Tr(a@) is ever not in Z, then by 
replacing @ by p™@ for suitably chosen m we obtain a contradiction. Now 
take M large; if cg is close enough to the identity character, af must be 
in 0~! for all 6 with ||G||, < M, where 0 is the different, and this implies 
that ||a||, is small. 

It remains to show that we have found all the characters of k,. But the 
characters which we have found certainly form a closed subgroup H# in the 
notation of Theorem A4; and if this is not the whole of k, then there is a 
non-trivial subgroup H C ky on which H? is trivial. Choosing B #0 in H 
we obtain a contradiction. O 


Consider next Vz. Let S be any finite set of places, including all the 
Archimedean places, and let Gs be the subgroup of V;, consisting of those 
adéles [] a, for which a, = 0 for v in S and a, is in oy for all v. The Gs are 
a set, of small subgroups in the sense of the remarks below the Corollary to 
Theorem A4, so any character c on V; is trivial on some G's, say Gs,. Let 
Cy denote the character induced on k, by the natural embedding k, — Vy; 
thus cy, is trivial on 0, for almost all v. Since any @ in V; is the sum of an 
element in Gs, and a finite set of elements of various k,, 


e(8) = |[o(4.), 


where almost all the factors on the right are 1. Conversely, if we have any 
set of characters c, on the k, such that c, is trivial on 0, for almost all 
v, this formula defines a character on V;. This determines the algebraic 
structure of Vi. As for its topology, any compact set in V, is contained in 
I] S. where each S, is compact and almost every S, is 0,; hence a base 
for the open neighbourhoods of the trivial character is given by the sets 
[[ Cu, where each C,, is an open neighbourhood of the trivial character in ky 
and almost every C, consists of all characters which are trivial on o,. If v 
corresponds to a finite unramified prime p, such a character corresponds to 
an element of 0, under the identification in Lemma A6. We have therefore 
proved 
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Lemma A7 There is a natural identification YK = Vp. 


Explicitly, the character cq corresponding to a = [] a, in Vj is given by 


Ca UI Bu) = exp (2ni So (fewT rk, /Qu (206.))) (A5) 


where as before w is the valuation of Q which is the restriction of v, and t, 
maps Q, to R/Z. As in Lemma A6 the sign is plus for non-Archimedean 
and minus for Archimedean places. One useful consequence is that if a, 8 
are both in VY, then 


Ca(B) = 1 (a). (A8) 


Corollary The character cq is trivial on k if and only if a is in k. 


Proof Suppose first that a, @ are both in k. By (39), 
boo Y > Tre, /Qe (8) = D> tp D> Tre, /Q, (28) 


Pp pip 
where the sum on the left is taken over all Archimedean places; for each 
side is equal to the image of Tr,;q(af) in Q/Z. Thus the sum on the 
right of (A5) is 0. (It was for this reason that we introduced the unnatural 
looking sign convention in Lemma A6.) Now let G be the additive group 
of all adéles a = [[a, such that cq is trivial on k. We have shown that 
G > k, and G is a k-vector space; thus G/k is a k-vector space which is 
a subspace of V;,/k. But the latter is compact, by Lemma 26, so G/k is 
trivial. | 


We shall not need to know 5 in what follows; and this is just as well 
because in general no satisfactory description of it is known. But we shall 
need some information about its quasi-characters — that is to say, its 
continuous homomorphisms into C*. In particular if x is a quasi-character 
so is a+ |/al|\*x(a@), where the function ||.|| is that defined after the proof 
of Lemma 27. 


A2.2 Fourier transforms 


Let f(g) be a complex-valued function on a locally compact additive group 
G, and let » be a Haar measure on G. Suppose that f is continuous and 
integrable on G. The Fourier transform of f is the function on G given 
by 


fo = i. f(9)e(g)du. 
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The complex conjugacy sign appears here for historical reasons; however, 
complex conjugacy is bound to appear somewhere in the theory. The im- 
portance of this definition comes from the duality relation: 


Theorem A5 There is a Haar measure ji on G such that 
Jo) =f-a) = f floetoyan 
provided that the integral evists. 


Since G only determines the Haar measure on it up to multiplication, to 
find the correct j for any particular G we need to verify the result for one 
particular function f. If we can identify G with G then we can normalize 
us; for if we multiply » by a constant we have to divide f by the same 
constant, and there is a unique positive multiplier for which £2 is equal to 
pt. We shall show below that if G is R,C or k, then the measure p which 
was chosen in §A1.3 is the one which achieves this. It will then follow that 
the same statement holds for Vj. 

The modern theory of Fourier transforms recognizably includes the classi- 
cal analytic theory — though the classical results were proved under weaker 
conditions on f. There are three important special cases in classical pure 
mathematics: Fourier series, Fourier integrals, and Mellin transforms. The 
first two are straightforward. Since Z=-Tx R/Z, Theorem A5 gives 


fo) 1 
f(z) a caer => G= i: f(z)e~?™*"* dz. 

66 0 

Next consider R ~ R; then 
‘OO oO 
fay= f saererdr > lay= [flyer ay. (AN) 

Thus in this case » = 2. The easiest way to check that the multiplicative 
constant is correct is to note that f(x) = enm implies f(y) = enty 


Classically, the Fourier duality formula for C was not important enough 
to warrant a name. It asserts that if 


f(w) = ih f(z) exp(27iR(zw)) dz A dz 
then (at least up to a constant factor) 


f(z) = | i f (w) exp(—27iR(zw)) dw A dw. 
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To check that the factor here is correct, take f(z) = g1(zV2)go(yV2) where 
z=a2+iy; if w =utiv then f(w) = §(uV2)92(—-vV2) where gi, 92 are 
given by (A7). 

To check that ~ = py for k, take 


se) ={ 


1 for z in oy, 
0 otherwise; 


then by Lemma A5 


fu) = [ exp(-2niT, a, av) dud 


Pp 
_ f (Normd,)-"/? for y in 05", 
= 0 otherwise , 


because the integrand in the middle expression is a character on o,. Simi- 
larly we obtain 


[ i(v) exp(2xi Tre, 9, (2) dud = f(a) 


since p+ (0,51) = (Norm?,)y*(o,). Thus f = p. 


The Mellin transform formula is derived from (A7) exactly as in the clas- 
sical theory. For let g(z) = SY? an exp(2minz) where the a, are complex 
numbers such that an = O(n?—!£) for some real o and some ¢ > 0; this 
series is absolutely convergent in the upper half-plane Rz > 0, and g(z) is 
periodic with period 1. If L(s) = }>ann-* where s = o + ir then 


P(s)(2m)"*L(s) = } an [ Te 2y Ny = [ * gliv)yt dy. 


But this simply says that the left hand side, considered as a function of 
T, is the Fourier integral transform of 27e?"*’g(ie?"*). After some tidying 
up, the dual formula becomes 


ativ) = 5 [ T(s)(2nv)*L(s)ds 


where the integral is taken along the vertical line Rs = c. 


Theorem A6 (Poisson Summation Formula) Let H be a discrete sub- 
group of G such that G/H is compact, and let H* consist of the elements 
of G which are trivial on H, so that Ht = G/H. Then 


u(G/H) )> f(b) = >> f (at) (A8) 
H Ht 
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provided that f is integrable on G, )- 1, f(g +h) is absolutely convergent 
uniformly in g and S~ f(h") is absolutely convergent. 


Proof H' is discrete and G/H 4 is compact, by Theorem A4. Define the 
function ¢(x) on G/H by ¢(z) = oy f(x +A); then 
[ e@dus [sods d= [oat aan. (AD) 
G/H G G/H 
Now Theorem A5 gives 
$(2)“(G/H) = S72 bY) 4(h) 
Ht 


up to a constant factor. To see that the constant is correct, set #(z) = 1; 
then 4(1) = u(G/H) and ¢(h") = 0 otherwise, the latter result. coming 
from writing x29 for x in the second equation (AQ) where h!(z9) 4 1. Also 


B(ht) = : | SER = [ i(9)Ria)du = f (at), 


the change in the order of summation and integration being justified by 
the hypotheses on f. Hence 


$(2)u(G/H) = > h¥(a) f(b), 
and writing z = 0, (0) = >> f(h) gives the theorem. O 


If we multiply 4 by a constant, we multiply both sides of (A8) by the 
same constant; so the theorem is true for any choice of y. The special case 
which will be needed in §15 is as follows. 


Corollary Let f(€) be a function on V_ which is integrable and for which 
oe in eS(@(x+&)) is absolutely convergent for all adéles € and idéles a, 
uniformly in € for each a. Suppose also that >>. in & f(az) converges for 
each a, where we have identified Ve with V,. Then 


w(Vi/k)\lal| S> f(az)= S> fanz). 
zink zink 
Proof Write 9(€) = f(a&); then 
atm) = [ o€)entau= [fas \erCEn)an 


= jal) rh I(@ela-"En)dp = |e? flan), 
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where to go from the first line to the second we have written a~1€ for &. 
Now apply the theorem to g(&) with G = V, and H = k, and use the fact 
that H! = k by the Corollary to Lemma A7. oO 


If we identify Ve with V,, the symmetry property of the Fourier transform 
becomes 


i(n) = i: f@alemd = f= i F(n)ea(én)du — (A10) 


where y is normalized by the condition u(V,/k) = 1. For we know from 
Theorem A5 and (A6) that f(é) = A f, f(n)cx(€n)dy for some constant A 
depending on p. Applying Theorem A6 to both f and f and remembering 
that c1(&€n) = c1(—€n) we obtain A(y(V_/k))? = 1; so A = 1 is equivalent 
to u(Ve/k) = 1. 


A2.3 Galois theory for infinite extensions 


Let K be a separable normal extension of a field k, with Galois group G. 
If [K : k] is finite, the fundamental theorem of Galois theory states that 
there is a one-one correspondence between fields L with K > LD D> k and 
groups H C G, given by the relations 


L consists of those elements of K which are fixed under H, 
H consists of those elements of G which fix L elementwise. 


The usual proofs of this depend on the finiteness of [K : k], because they 
use counting arguments; so they collapse when [K : k] is infinite. It is 
natural to ask how much the conclusion needs to be changed in that case. 


Lemma A8 Let K, L be fields such that K > LD k and K is Galois over 
k. If H is the Galois group of K over L, then the fixed field of H is L. 


Proof This follows directly from the finite case. For let € be any element of 
K not in £ and let & with K > R DL be the splitting field of the minimal 
polynomial of € over ZL. There is an Z-automorphism of & which does not 
leave € fixed, and this can be extended to an L-automorphism o of K with 
the help of Zorn’s Lemma. Since o is an element of H, the fixed field of H 
cannot be larger than L. O 


There are in general too many subgroups H of G for them all to be Galois 
groups of K/L for some intermediate field L; so how do we identify those 
H which have this property? To answer this, we introduce a topology on 


G. 
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Let K, run through those subfields of K which are normal over k and 
for which [K; : k] is finite, and let 


GD G; = Gal(K/K;), 5; = G/G; = Gal(Ki/k). 


If K; > Kj there is a natural homomorphism ¢,; : S$; + $;. The $; form 
an inverse system of groups with connecting homomorphisms ¢,;;, whose 
inverse limit is G with the obvious maps ¢; : G — S; = G/G;. For let o 
be any element of G. Each € in K is in some K;; and the actions of o and 
gio on € are the same. Hence o is uniquely determined by a knowledge of 
all the dia. 

The S; are finite groups, so we give each of them the discrete topology; 
and we give G the topology induced on it as an inverse limit — that is, 
the finest topology such that each ¢; is continuous. A base for the open 
subsets of G is given by the cosets of the G;, and any such set is also 
closed because it is the complement of the union of the other cosets of the 
same G,. The topology obtained in this way is Hausdorff, and with it G is 
compact. However, it is not known, even when G = Gal(Q/Q), whether 
every subgroup of finite index in G is open; this is one of a number of 
related unsolved questions, all of which appear to be very difficult. 


Lemma AQ Let H be a subgroup of G. Then H is the Galois group of K 
over some field L with K > LD k if and only if H is topologically closed. 


Proof Suppose first that H = Gal(K/L). Ifo is in the closure of H and 
€ is any element of L, we have to prove that of = €. Let Ko C K be the 
splitting field for the minimal polynomial of € over k; then Ko is Galois 
over k and [Ko : k] is finite. Let Go, So and ¢o be as above. Now @oH C So 
is closed because So is discrete, and it leaves € fixed; hence o belongs to 
the closed set do 60H and leaves € fixed. 

Conversely, if H is closed let L be its fixed field. If o is not in H, we 
have to show that there are elements of Z which are not fixed by o. There 
is a basic neighbourhood N of o which does not meet H — that is, there 
is a K, finite and Galois over k with ¢: : G — S; = Gal(K,/k) such that 
1H does not contain ¢,0. Let R, C Ki be the fixed field of ¢,H C S$}. 
Then A; C L because f, is fixed elementwise by H; but A; is not fixed 
under oc. O 


Putting the last two lemmas together, we obtain the Fundamental ‘The- 
- orem of Galois Theory for arbitrary extensions (which contains the corre- 
sponding one for finite extensions): 
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Theorem A7 In the notation above, there is a one-one correspondence 
between fields L with K > L Dk and closed subgroups H of G. 


There is one case in which we know how to describe G explicitly: that is 
when k is a finite field F, and K = k is its algebraic closure. Now there is 
just one extension of k of each finite degree n, which we call K‘). Since 
§() = Gal(K(™ /k) is cyclic of order n and is generated by the Frobenius 
endomorphism o : x + 2%, we can identify S$) with Z/(n) by identifying 
o” with vy modn. Here K(™ 5 K™) if and only if n|m, and the map ¢m,n 
is then identified with the map Z/(m) — Z/(n) which sends v mod m intc 
vy modn. 

If m,n are coprime, the degree of the compositum K (m) K(") must be a 
multiple of both m and n, whence K( K( = K(™), The natural mar 
gimn) _, 9(™) x §(”) of Galois groups is one-one and can be identified with 
the natural map of residue groups; and it is an isomorphism. In forming 
G, the inverse limit of the S$‘), we can therefore treat the different prime 
factors of the n separately; thus G is naturally isomorphic to the product 
of the inverse limits of the S(") ~ Z/(p”). These inverse limits are just 
the Zp. 


Exercises 


Chapter 1 


1.1 For the ring R = C/z, y], which has unique factorization, prove that 


(i) apart from (0) and R itself, there are two kinds of prime ideals, one 
kind being maximal and the other not, 

(ii) any non-zero ideal can be written in at most one way as a product of 
prime ideals, 

(iii) there are ideals which cannot be written as a product of prime ideals, 

(iv) division by ideals, even when possible, is not always unique. 


[Think geometrically. The two kinds of prime ideals correspond to points 
and irreducible curves.] 


1.2 Show that neither of Z[/—6] and Z[V/5] is a unique factorization 
domain. . 

[Consider the factorization into irreducible elements of 6 in the first. case 
and 4 in the second.] - 


1.3 Find a unit in Q(/6) and show that this field has class number 
h=l. 

1.4 Find a unit in Q(./22) and show that this field has class number 
h = 3. Deduce that X? + 22Y3 + 3Z% = 0 has no rational solutions. 

[If x,y,z is a solution with x,y coprime integers, show that p3 is not 
principal, where p3|3, but that (z+ y¥/22) = p3a®. The equation is in fact 
soluble in every Q,, but to prove this requires ideas not in this book; see 
for example [Ca].] 

1.5 Suppose that the extension K/Q is normal and has a Galois group 
which is simple but not cyclic. Show that there is no rational prime p such 
that (p) remains prime in K. 
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1.6 Show that the ring of all algebraic integers is not Noetherian, and 
deduce that Q is a Noetherian ring not all of whose subrings are Noetherian. 

[Fix a prime p and let M,, be the ideal generated by p'/™. show that the 
M, form a strictly increasing sequence.| 


1.7 Suppose that k contains ¢, a primitive p-th root of unity where 
p is prime, and that K is Galois over k with [K : k] = p; and write 
G = Gal(K/k) = C,. Show that K = k(#/a) for some a in k. 

[Let o be a generator of G. Take a = yet ee -o"® for @ in K; and 
show that one can choose @ so that a # 0.] 


1.8 Let K,k be algebraic number fields with K totally complex, k totally 
real and [K : k] = 2. If zx denotes the group of roots of unity in K, show 
that [O% : ofux] = 1 or 2. 

[The non-trivial automorphism of K’/k is complex conjugacy. Show that 
n ++ n/f induces a homomorphism 0% — wx/p% the kernel of which 
contains oj. Conversely, show that if a is in the kernel then a/@ = 
¢? = ¢/¢ for some ¢ in px; 80 a/¢ is in of] 


1.9 Show that the only integral solutions of X? = Y? + 13 are given by 
X =17, Y = +70. 

[Factorize the equation in Q(./—13), which has class number 2. Show 
that if x,y is an integer solution then (y + /—13) must be the cube of an 
ideal and hence y + /—13 = (a + bV—13); thus 1 = b(3a? — 13b?).] 


1.10 Let & be an algebraic number field. Show that there is a finite 
set of prime ideals p;,... ,p, with the following property: if R is the ring 
consisting of those elements of k which are integers except perhaps at the 
pi, then & is a principal ideal domain. 


1.11 Let ¢” = 1 and assume that a = (7/2, ¢™)/m is an algebraic 
integer. Show that either a = ¢™ for each i or a=0. 


1.12 Let X be an indeterminate. Show that the ring Z[X] is Noetherian 
and integrally closed in its field of fractions, but is not a Dedekind domain. 


1.13 Let k be an algebraic number field. Show that o, is a principal 
ideal domain if and only if it satisfies the following condition: for every a 
in k but not in o, there exist G,y in og such that 


0 < |norm,;aq(af — 7)| < 1. 
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Chapter 2 


2.1 If K, k are algebraic number fields with K > k prove that the relative 
discriminant of K/k is normx/40K/k- 


2.2 Show that X? — 82Y? = +2 has solutions in every Zp but not in Z. 
What conclusion can you draw about Q(vV82)? 


2.3 Give an example of finite extensions K1, K2 over Q such that 
[K, Ke : Ki} # [Ke : Ky M Ka]. 
[Thus without the Galois condition, nothing is left of Lemma 25.] 


2.4 Show that the class of the relative discriminant for K/k in the ideal 
class group C, is a square. 
[Argue locally, using the fact that Dx @, op is a free o,-module.] 


2.5 Let K Dk, let $B be a prime ideal in K and let p be the prime ideal 
of k divisible by %. Show that is wildly ramified if and only if Tr Keg /kp & 
is in pp for every a in Dg. 


2.6 Let @ be a prime and let f(X) = X* — aX — b be an irreducible 
polynomial in Z[X] for which (@— 1)a and £b are coprime. Let K be the 
splitting field of f(X) over Q. Show that 


(i) if p ramifies in K/Q then ep = 2, 

(ii) G = Gal(K/Q) is S¢, the symmetric group on @ elements, 

(ili) if k is the fixed field of the alternating group Ag, then K/k is unram- 
ified. 


[Since X f'(X) — £f(X) is linear and not divisible by any prime, any re- 
peated factor of f(X) must be linear and have multiplicity 2. Now Theorem 
19 implies (i) and shows that the non-trivial element of the ramification 
group of p is a transposition. Since G contains an element of order ¢ and 
a transposition, this gives (ii); and (iii) follows by applying Theorem 16 to 
the ramification group of p.] ; 


2.7 Prove that if p is unramified in Ky /k and in K2/k, then it is unrarn- 
ified in K,.K2/k; and that if p splits completely in K,/k and K2/k then 
it splits completely in K,K2/k. Show however that if p remains prime in 
K,K2/k then [K,K2 : k] is the least common multiple of {Ki : k] and 
[K = k}. 

Give an example where p is ramified in K;K2/Q but not in K,/Q or 
K2/Q. 
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[For the last part, take p = 3 and Ky, K2 to be conjugate fields Q( ¢/m).] 


2.8 Let K = Q(V7, V13). Show that for any integer a in K, the dis- 
criminant of 1,a,a?,a° is divisible by 3; and deduce that 1,a,a?,a° can 
never be a base for the integers of K. 

[Use the fact that (3) splits completely in K, and therefore 3|(a? — a).] 


2.9 Show that x* +1 is reducible in Q, for every p > 2. 


Chapter 3 


3.1 Let p =O, +G,! where ¢, is a primitive n-th root of unity. What 
additional condition on n is needed for the ring of integers of Q(p) to be 
Zp}? 


3.2 Let p be a prime and m an integer such that p|(m” — 1) forv =n 
but not for any smaller vy > 0. Show that p has at least one prime factor 
with e = f = 1 in Q(¥1) and deduce that p = 1modn. Hence show that 
for any n there are infinitely many primes p = 1 modn. 


3.3 Let K = Q( 4/1) with n = p.---pm, where the p, are distinct odd 
primes, and let K,, be the field of (n/p,)-th roots of unity. Let o, be a 
generator of the cyclic group Gal(K/K,), write o = o1---om and let L 
be the fixed field of 0. Show that K/L is unramified at each finite place. 
What additional condition is needed to ensure that it is also unramified at 
the infinite places? 


3.4 (i) Let G be a finite abelian group. Show that there are fields K, L 
with K = Q(¥/1) and L C K such that Gal(K/L) ~ G and no (finite 
or infinite) place ramifies in K/L. [Use the results of the two previous 
exercises. 

(ii) Using Theorem 37, show that the ideal class group of L contains a 
subgroup isomorphic to G. , 


The following two exercises illustrate the method of infinite descent, and 
show the importance of choosing the right equation to apply the method 
to. : 


3.5 Show that X44 Y4 = Z? has no non-trivial solutions in Z. 
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[If there are solutions, let x,y,z be the one with |z| minimal. Without 
loss of generality, assume y odd; then x? = 2uv, y? = u? — v? with u,v 
coprime. Thus u = A?, v = 2B? and y? = A‘ — 4B4. Hence show that 
A? = (74 m?, B? = ém with £,m coprime and therefore both squares. 
Now |A| < |z| gives a contradiction.] 


3.6 Show that «X4 + Y* = Z? has no non-trivial solutions in Z[/—1] 
where ¢4 = 1. : 

[Let 7 = 1+ V-1. Show by z-adic arguments that it is enough to 
consider the case when 7 divides X but not Y or Z, and that then 1?|X. 
Deduce that we can take Z—Y? = r7e,u*, Z+Y? = 1~equ4 where €1, €2 
are roots of unity; and hence Y? = 1~4e3u4 + equ* where |v. Writing 
Y /—1 for Y if necessary, show that eg = 1. Verify that the same power of 
m divides v and X, and derive a contradiction.] 


3.7 Repeat the investigation at the end of §14 for the case when K is 
the splitting field of an irreducible non-normal cubic equation over Q. 


3.8 Verify equation (46) by elementary means. 

[The only substantial calculation concerns the powers of 2; here it pays 
to split cases, using the properties both of the different and of a base for 
Ox] 


3.9 Let k = km be the field of m-th roots of unity and ko its maximal 
totally real subfield; and let yz, be the group of roots of unity in k. 


(i) If m is a prime power or twice a prime power, show that of = of, pk. 
(ii) If not, show that [of : of, 44] = 2. 


[Use the result of Example 1.8. For (i) let m = p” or 2p’. If the result 
is false and p > 2, there exists 7 in of with n? in ko but 7 not in ko; 
then k = ko(,/7) would not be totally ramified at p. If p = 2 and 7 in 
oj is not in of we then ¢ = 7/7 is a primitive m-th root of unity; then 
norm, /,¢ = +V—I1 but norm,,z7 is a power of /—I where L = Q(/—1). 
For (ii), show that 7 = 1—¢ is a unit, where ¢ is a primitive m-th root of 
unity and deduce that the homomorphism of Exercise 1.8 is onto.| 
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Chapter 4 


4.1 If the Dirichlet series }‘a,n~* converges at s = so, show that it 
converges whenever Rs > Rsg and that it converges absolutely whenever 
Rs > Rsy +1. 

{For the first assertion, use summation by parts.| 


4.2 If the limit (98) exists, show that the limit (99) exists and has the 
same value. 
[Use summation by parts.] 


4.3 Let S be a finite set of places and let x be a character on I* such 
that for any neighbourhood N of 1 there exists € > 0 such that the inverse 
image of N contains all principal ideals (€) with {|€ — 1], < € for each v in 
S. Show that x satisfies (91). 

[Choose the n,, $y so that (91) holds for all units with || —1||) < €; then 
proceed as in the proof of Lemma 34.] 


4.4 Let x be a non-trivial character of conductor m; show that 


x(n) 


for all integers N. Deduce that if p is prime and x is a non-trivial character 
mod p then x(n) # 1 for some n < p/?(1 + logp). 
[If tz (x) is the Gauss sum as defined in (54), show that 


< m/2(1 + logm) 


N N 
Yo x(n) = (1K) DS YS X(b) exp(2mibn/m) 
1 n=1bmodm 
bohpie ecehes exp{2mib(N + 1)/m}—-1 
= (1%) 2a exp{2mib/m}-1  — 


In each term of the last sum, the numerator is absolutely bounded by 2 
and the absolute value of the denominator is {2 sin(3b/m)|. Hence 


[>> x(n)| < 2m-*/? § © cosec(xb/m) 


where we can take the sum to run over 0 <b< $m.] 
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Chapter 5 


5.1 Write i = /—1 and let K = Q(i, ¥2), ky = Q(i) and kz = Q(V/2). 
Show that K is abelian over each of k, and ka, with Gal(K/k,) ~ Cy and 
Gal( K/k2) i C2 x C2. 


(i) If H is the congruence divisor class associated with K/k;, show that 
its conductor divides (1 +7)’ and that H contains i and 5, and 
that these facts specify it completely. 

(ii) If H is the congruence divisor class associated with K/k2, show sim- 
ilarly that the finite part of its conductor divides (V/2)° and that 
H contains 3, and that these facts specify it completely. 


[In (i), every a = 1 mod (1+2)? is a (1+%)-adic fourth power and (1+) 
is the only prime which ramifies in K’/k1; these give the assertion about the 
conductor. Since the units represent all the odd residue classes mod (1+2)3, 
we need only identify the residue classes mod(1 +)’ in H which are 
congruent to 1mod(1+ i). This appears to give six candidates for H, 
but four can be rejected because they are not invariant under complex 
conjugacy. One of the remaining two contains 5 and the other contains 
1 + 4%; but the second must be rejected because (1 + 42) does not split 


completely in K/k;.] 


5.2 Let X,... ,X4 be integers not all divisible by any p = 7 mod 8, such 
that 


Xft + 4X} = (XP - 2X1 X2 + 2XF) (XP + 2X1 Xq + 2X3) = X? - 2X?. 


Show that |X3| #5 or 7mod8. 

[Assume that X3 is odd and show that 4|X2. Using the notation and 
restilts of the previous exercise, show stitcéssively that the ideal (X?+2i.X2) 
is in the kernel of the Artin reciprocity map for K/k,, that the first degree 
primes in this kernel are those whose Norms split completely in K, and that 
therefore the ideal (X3 + X42) is in the kernel of the Artin reciprocity 
map for K/k2. Hence deduce that X4 is even and |X3| = 1 or 3mod8.] 


5.3 Let k be totally complex and ko totally real with [k : ko] = 2. If 
h, ho are the class numbers of k, kg respectively, show that ho|h. 

{Let ZL be the Hilbert class field of kg. Show that LN k = kp whence 
[kE: k] =[L: ko] = ho, and that kL/k is unramified and abelian.| 


5.4 Let (7) denote the quadratic residue symbol in Z, where m,n are 
coprime and n is odd and positive. Using the general properties of the 
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power residue symbol, show that (>) and (2) only depend on nmod 8, 
and hence deduce the auxiliary laws 


(DS Enh, ene 


It is possible to derive Theorem 25 in the same way, but this involves 
an unnatural case-by-case calculation; a somewhat less ugly version of the 
same calculation can be found in the next exercise. 


5.5 (i) Let m = 2 and k = Q in the machinery of §19. Show that the 
Hilbert symbol (a, b)2 depends only on a and b mod (Q$)?, and construct 
a table of its values. [Choose particular representatives for the classes of a 
and b and use (104) and Lemma 38.] 


(ii) Check the results of (i) by means of (105). 
(iii) Deduce the law of quadratic reciprocity for Q. 


5.6 Take p = 3 in the notation of the exercise on page 111. Show that 
any a@ in k* can be written in essentially one way in the form a = C47’ ag 
with ag = +1 mod 3. Prove that 


a) 

Bo ag 

if each of ag, Jo is congruent to +1 mod3 and S(ag) M S(@9) = S. Prove 
also that 


if ag = +(1 + 3(r + 8€)). 


Suggested further reading 


There are a substantial number of texts which cover more or less the same 
material as Chapters 1 and 2. The most thorough and careful is 

[FT] A. Fréhlich and M.J. Taylor, Algebraic Number Theory, Cambridge 
University Press, 1991. 

The texts which I have found most inspiring are 

[Weyl] H. Weyl, Algebraic Theory of Numbers, Princeton University Press, 
1940, 

[Ca] J.W.S. Cassels, Local Fields, Cambridge University Press, 1986. This 
goes far wider than its title suggests, with a particularly strong emphasis 
on applications to Diophantine equations. 

A more comprehensive account of the local theory, both elementary and 
advanced, is 

[Se] J.-P. Serre, Local Fields, Springer-Verlag, 1979. (The original French 
version was Corps Locaur, Hermann, 1968.) This can fairly be described 
as a masterpiece. 


The best reference for class field theory (covering both local and global 
fields) is still 

[CF] J.W.S. Cassels and A. Frohlich (editors), Algebraic Number Theory, 
Academic Press, 1967. 

Alternative accounts can be found in 

[N] J. Neukirch, Class Field Theory, Springer-Verlag, 1986; 

[L] S. Lang, Algebraic Number Theory, Springer-Verlag, 1986. This was 
originally written in 1970; it also covers the elementary theory and a sub- 
stantial amount of analytic material. 

An interesting and recommendable illustration of how much can be achieved 
without overt Galois cohomology is the second half of 

[Weil] A. Weil, Basic Number Theory, Springer-Verlag, 1973. The first half 
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of this book is an account of the elementary theory, dominated by the use 
of Haar measure; this is an interesting approach, but Weil’s style does not 
help the reader. 

The best account of Iwasawa theory is to be found in 

[Wa] L. Washington, Introduction to Cyclotomic Fields, Springer-Verlag, 
1980. 

Books on the computational aspects of the subject become out of date as 
new packages are developed. Subject to this, the definitive account is 
[Co] H. Cohen, A Course in Computational Algebraic Number Theory, 
Springer-Verlag, 1995. 

A good introduction to these aspects can be found in 

[Sm] N.P. Smart, The Algorithmic Resolution of Diophantine Equations, 
Cambridge University Press, 1998. 
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